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Preface 


This volume contains a current survey to the end of 2011 of seasonal or calendar 
anomalies by Constantine Dzhabarov and me focusing on the U.S. stock market. 
In addition, there are reprints of various key papers of mine on U.S., Japanese and 
other calendar and fundamental anomalies plus the related topics of strict arbitrage 
and risk arbitrage. 

The reprinted papers are arranged chronologically starting with the early 1987 
paper on the turn-of-the-year effect with Ross Clark along with a description of 
this trade. An update of the turn-of-the-year effect to 2011 is in the Dzhabarov— 
Ziemba survey paper. But papers on specific topics are grouped together. Donald 
Hausch and I in 1990 wrote two papers on pure arbitrage where you construct a 
situation such that you cannot lose and likely can gain. Arbitrage and risk arbitrage 
are discussed for the game of Jai Alai in my 2008 paper with Daniel Lane. Condi- 
tions for pure arbitrage are presented as well as approximations for risk arbitrage 
where one cannot guarantee profits but expected returns or expected utility can be 
large. 

This is followed by the 1995 risk arbitrage paper on Nikkei put warrant mar- 
kets with Julian Shaw and Edward O. Thorp. There we describe a AA’ trade in 
which A is bought long and A’ which is similar but differently priced is shorted. 
Edward O. Thorp won the over $1 million risk adjusted trading contest held by 
Barron’s in 1990 based on this trade. Interestingly, those in Canada who could not 
price properly the over priced put warrants in the end made about $500 million 
because of how much the Japanese stock market dropped, namely about 56% in 
its 1990 move down. All this was predicted by my bond-stock earnings yield crash 
model as the brief note from Wilmott in Chapter 6 indicates. 

Going to Japan in 1988-1989 as the first Yamaichi Visiting Professor of Finance 
was a unique experience. It started me on a path on practical investment manage- 
ment at a high level that merged nicely into my nine years as the main consultant 
to Frank Russell’s research department from 1989 to 1998 when they sold the com- 
pany. While in Japan, I studied land and stock prices a lot as that’s where the 
action was. The stocks had much land holdings and both were used as collateral 
to buy more of the other. Japanese land was the most expensive in the world with 
a square meter in central Tokyo evaluated at nearly $300,000 at the peak. Doug 
Stone and I wrote the Journal of Economic Perspectives article describing this. I 
recall the editor, the great Joseph Stiglitz, sending me nine single spaced pages of 
suggestions of which we followed up enough to get the paper published. In the end, 
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when the Japanese stock market crashed in 1990 and land started to fall in 1991, 
there was about $5 trillion lost in land and another $5 trillion lost in stocks. Too 
bad that the Japanese did not invest more wisely. They simply bought what they 
already owned, namely their own land and stocks. Despite lots of overpriced pur- 
chases, see Ziemba and Schwartz (1992) for lists, they only had 3% of their assets 
invested abroad. Japanese stock market anomalies are studied in the 1991 Japan and 
the World Economy paper. My late friend, the twentieth century’s most important 
economist, Paul Samuelson, wrote to me saying that the Japanese stock market was 
held together by chewing gum. He was right as it did collapse. But he also felt it was 
not based on the economics. However, I found that basically all the U.S. anomalies 
were there but on slightly different dates. For example, the turn-of-the-month that 
in the U.S. was on days —l to +4 was on —5 to +2 in Japan. The reason being that 
salaries and sales pushes started then. This paper has a comprehensive analysis of 
Japanese anomalies paralleling the U.S. research. 

Cappoza and Ziemba’s 1993 paper discusses the design and some trading expe- 
rience concerning hedge funds based on anomalies. 

There are three 1994 papers. My late Finnish friend Teppo Martikainen, Jukka 
Perttunen and I studied the turn-of-the-month effect during the period January 
1988 to January 1990 in various markets across the world. The effect was working 
then and its reasons are related to new money coming into the market on the —1 
or +1 days. See also the Russell research report in Chapter 6 and the newspaper 
clippings there concerning the turn-of-the-month effect. Chris Hensel, another late 
colleague from Frank Russell and I studied some anomalies focussing on the turn-of- 
the-month effect over the long period 1928-1993 for the U.S. Chris Hensel, Gordon 
Sick and I in 1993 studied the turn-of-the-month effect the U.S. stock index futures 
markets from 1982-1992. Chapter 6 also has a Frank Russell research report Chris 
Hensel and I wrote in 1995 with European, North American, Pacific and worldwide 
January barometer results. 

The European Journal of Operational Research survey paper I wrote is broader 
and covers fundamental as well as seasonal anomalies. A monthly model we made 
in Japan to rank all the Tokyo first section stocks from 1 to N using thirty fun- 
damental factors is discussed. This follows in spirit the Jacobs and Levy (1988) 
model for the U.S. that started their very successful investment management firm. 
The top seven factors future projected earnings over price which is a forward look- 
ing PE ratio, the ordinary trailing PE ration, small cap and two mean-reversion 
factors. With monthly revisions and yearly re-estimation, the model performed 
well. I consulted ith Buchanan Securities in London on a similar model. They 
bought our book, Ziemba and Schwartz (1991), and discovered that in the mid- 
dle of the book was a better factor model than what they were using. The model 
from Japan was estimated with rising stock prices but, slightly modified, worked 
well in a declining stock market. Buchanan was in the business of going long cheap 
warrants and shorting stocks and the model helped their selection process. The 
European Journal of Operational Research paper discusses seasonally effects such 
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as the turn-of-the-month, holiday, the January barometer, time-of-day, the Value 
Line enigma and other topics. 

My 2003 monograph on the Stochastic Programming Approach to Asset-Liability 
and Wealth Management published by Association for Investment Management 
and Research (AIMR) studies the 2000-2002 U.S. stock market crash among other 
topics. This uses the bond stock earnings yield model first presented in the 1991 
book Invest Japan. The model was discovered from the 1987 crash and predicted 
many crashes denned as a fall of 10%+ within a period of time about 4-12 months, 
usually from the start. The model called this internet bubble crash. In the 2005 
paper, Finnish colleagues Matti Koivu and Teemu Pennanen and I study the co- 
integration properties of the bond stock earnings yield differential (BSEYD) model 
focusing on the U.S., U.K. and Germany. The 2008 paper with Klaus Berge and 
Giorgio Consigli shows that if you stay in the stock market when the BSEYD model 
is not in the danger zone and out of the market when it is gives about double the 
final wealth that buy and hold provides with lower risk. See Lleo and Zimba (2012) 
to 2006-2009 predictions. 

Anomalies and behavioral biases are highly related and both yield advantages 
for the investor. In horse race markets, the most important behavioral bias is the 
tendency for the best, i.e., lowest odds horses to be under bet and the longshots over 
bet. Indeed, longshots at 100-1 have true value about 700-1. This has been known 
bookmakers for at least 100 years. In the 2008 Handbook on Sports and Lottery 
Markets, which I edited with my long time racing research colleague Donald B. 
Hausch, we study this favorite-longshot and other biases. We have used such ideas 
in various betting strategies discussed in our other books such as Ziemba and Hausch 
(1984, 1986, 1987, and 2008) and Hausch, Lo and Ziemba (1994, 2008). My paper 
on the efficiency of racing, sports and lottery betting markets surveys such biases in 
many markets. Included are favorite-longshot graphs for horse race betting markets 
and how they evolved over time, football, basketball efficiency and discussions of 
our weak form inefficiency in racetrack place and show markets as well as lottery 
applications including inefficiencies with unpopular numbers. 

In my 2008 paper with Robert Tompkins and Stewart Hodges, we show that 
S&P500 futures options display biases similar to the favorite-longshot bias. This 
yields ideas for hedge fund trading strategies that I have used extensively. 

Finally, my 2008 paper with Marshall Gramm discusses a very interesting race- 
track anomaly. This concerns the Kentucky Derby, Belmont Stakes and other races. 
In these two races, the horses have never run the distance they are about to run. 
So maybe their breeding will forecast if they have enough speed and stamina to 
win these tough races. This anomaly, like many others, has changed over time but 
still seems to be there in some form. Rod Bain, Donald Hausch and I, in our 2006 
paper, look deeper into the dosage theory applied to the Kentucky Derby during 
1981-2005, showing the great advantage of this anomaly. 

Thanks go to Constantine Dzhabarov for working with me the last few years on 
stock market anomalies that led the to the main paper of this book. Discussions with 


xiv Calendar Anomalies and Arbitrage 


the great investors Edward O. Thorp and Blair Hull as well as my many authors 
listed above have been helpful over the years. Finally, my wife, Sandra Schwartz, 
helps me in a myriad of ways as a sounding board, critique and producer of the 
books. 


William T. Ziemba 
Vancouver, January 2012 
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Chapter p 
Introduction — Calendar Anomalies! 


Constantine S. Dzhabarov 


Alpha Lake Financial Analytics Corp, Canada 


William T. Ziemba 


University of British Columbia, Canada 
ICMA Centre, University of Reading, UK 


This chapter is a survey of seasonal anomalies. Ziemba has been involved in the 
research and trading of such anomalies as the January turn-of-the-year effect since 
1982. His research plus that of other academics plus the very useful practitioner 
research of Yale Hirsch’s Stock Trader’s Almanac starting in 1972 is reviewed. (We 
academics reference Hirsch but the Hirsches operate in a closed economy, not ref- 
erencing others.) The discussion begins with an assessment of why the seasonal 
anomalies are so controversial but valuable and discusses some survey papers and 
books. Then beginning with the seminal anomaly, the January small firm effect, 
we discuss various other anomalies and their use in strategies including the con- 
struction of seasonality calendars that rank the various trading days of the year. 


1.1 Introduction to Seasonal Anomaly Effects 


Seasonality of stock markets has a long history despite the academic research being 
dominated by efficient market theory as surveyed by Fama (1970, 1991). Small firm 
effects were popularized by University of Chicago students Banz (1981), Reinganum 
(1981), Blume and Stambaugh (1983), Roll (1983), and Ritter (1988) among others. 


1 Dedicated to the memory of Merton H. Miller, Ziemba’s co-host in 1996 at the Graduate School 
of Business, University of Chicago, and to the memory of Chris Hensel, Ziemba’s co-author of 
many anomaly papers at the Frank Russell Company and University of Chicago MBA. It was 
there in the early 1980’s when Banz, Blume, Keim, Reinganum, Ritter, Roll, Stambaugh and 
other students at the most strongly efficient market oriented US finance department published 
small stock market anomalies papers in top finance journals and opened up the area. Miller, a 
strong efficient market academic but also savvy practical student of the markets used to tell me: 
“The half life of an anomaly is three years.” Ziemba’s experience since 1982, when he first traded 
the turn-of-the-year effect in the futures markets, is that when markets are regular (not too high 
volatility), the anomalies tend to work. But each year or play usually is slightly different and may 
move around. So constant research and careful risk control is important in using these results in 
trading. The true test is can you use them to make excess risk adjusted profits and Ziemba believes 
this to be the case. This monograph also updates some of the results from Dzhabarov and Ziemba 
(2010, 2011) and various anomaly papers Ziemba has published. 


2 Calendar Anomalies and Arbitrage 


Early surveys are in Lakonishok and Smidt(1988), Thaler (1992) and Ziemba 
(1994). The latter references considerable regularity of various seasonal anomalies 
in Japan as well as in the U.S. Jacobs and Levy (1988abc) have used seasonal 
and fundamental factor model derived anomalies to create a multibillion dollar 
investment firm. Dimson (1988) and Keim and Ziemba (2000) present whole books 
with studies across the world. The Stock Traders Almanac discusses some such 
anomalies in yearly updates; see Hirsch and Hirsch (2011). 

Anomalies of the seasonal variety as discussed in this chapter and those based 
on fundamental and other factors in the rest of this book, and in Keim and Ziemba 
(2000) and Zacks (2011) are not fully accepted nor believed by many strong efficient 
market theorists. Part of this dismissal is that the anomalies are too small to be 
bothered with as Ross (2005) argues. So, more or less, does Fama (1970, 1991).The 
great financial empiricist Roll(1994) makes the startling statement that even with 
considerable resources, he has never been able to find a profitable anomaly. The 
well-known book Malkiel (2011) even states that strong effects like the January 
effect do not exist. Marquering, Nisser and Valle (2006) argue that the anomalies 
disappear after they are published, although some reappear; see also Dimson and 
Marsh (1999) and Schwert (2003). Hudson, Keasey and Littler (2002) and Lucey 
and Pardo (2005) discuss how anomalies are affected by papers published on them. 

There also is the serious issue of data mining. Indeed, many results are in-sample 
and true tests out-of-sample. Statistical verification of the actual existence of signif- 
icant seasonal anomaly effects is studied by Sullivan, Timmerman and White (1999) 
who analyze 9452 calendar based trading rules. See also Hansen, Lunde and Nason 
(2005) who study 181 calendar effects and Lo and MacKinley (1990) who discuss 
data snooping biases. Also t values tend not to show statistical significance in many 
cases where successful trades have been made because of high standard deviations. 

Rather than debate such people, Ziemba and Ziemba (2007) simply argued that 
there are five basic stock market camps. Each has a cut or version of certain sections 
of the market and makes its point for a certain subset of market participants, 
instruments and strategies. There may be other classifications but these provide a 
useful framework for discussion. 


The Five Groups are: 

1. Efficient markets (E) 

2. Risk premium (RP) 

3. Genius (G) 

4. Hog wash (H) 

5. Markets are beatable (A) 


The first group are those who believe in efficient markets (E). They believe that 
current prices are fair and correct except possibly for transactions costs. These 
transaction costs, which include commissions, bid-ask spread, and price pressures, 
can be very large.” 


2A BARRA study by Andy Rudd some years ago showed that these costs averaged 4.6% one-way 
for a $50,000 institutional investor sale of small cap stock. This is if you use a naive market order 
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The leader of this school which had dominated academic journals, jobs, fame, 
etc. in the 1960s to the 1980s was Eugene Fama of the Booth School of Business, 
University of Chicago. A brilliant researcher, Fama is also a tape recorder: you 
can turn him on or off, you can fast forward or rewind him or change his volume, 
but you cannot change his views no matter what evidence you provide; he will 
refute it forcibly. In the aggregate there is much to support his case. One such 
example is the $300+ million gift from former student David G. Booth to name 
the Chicago Graduate School of Business earned from fees from his index fund 
firm Dimension Fund Advisors (DFA) founded with another Chicago student Rex 
Sinquefield. Booth never got his Ph.D. but Fama helped him get a job and later in 
1981 DFA was founded with Fama as a key advisor. Some multi-billion plus later 
in fees shows how low fees with simple strategies can add up with huge volume. 
Booth has a co-authored paper with Donald Keim on the January effect in the cash 
market in the Keim and Ziemba (2000) anomalies book. 

This group provided many useful concepts such as the capital asset pricing model 
of Sharpe (1964), Lintner (1965) and Mossin (1966), which provided a theoretical 
justification for index funds, which are the efficient market camp’s favored invest- 
ment mode. They still beat about 75% of active managers. Since all the managers 
comprise the market, that’s 50% of them beaten by the index. Transactions (com- 
mission plus market impact) such as exchange taxes, bid-ask spread and other costs 
eliminate another 25%. See Ziemba and Schwartz (1991:44—46) for examples of how 
few funds beat the index across the world. In a sample of 167 funds, only 48 (28.7%) 
beat the benchmark. 

Over time the hard efficient market line has softened into a Risk Premium (RP) 
camp. They feel that markets are basically efficient but one can realize extra return 
by bearing additional risk. They strongly argue that, if returns are above average, 
the risk must be there somewhere; you simply cannot get higher returns without 
bearing additional risk. For example, beating the market index S&P500 is possible 
but not risk adjusted by the capital asset pricing model (CAPM). They measure 
risk by Beta, which must be greater than one to receive higher than market returns. 
That is, the portfolio risk is higher than the market risk. But they allow other risk 
factors such as small cap and low book to price. But they do not believe in full 
blown 20-30 factor models such as used by Jacobs and Levy (1988) for the U.S. 
and Schwartz and Ziemba (2000) for Japan. Rather they prefer to use just a few 
factors and small cap and price to book value are favorites. Ziemba recalls Barr 
Rosenberg focusing on small cap and low price to book as the key factors in 1967, 
see Rosenberg, Reid and Lanstein (1985). Later, Fama and French (1992) took the 
credit for these ideas with a more complete study. Fama and his many disciples 
moved to this camp in the 1990s. This camp now dominates the top U.S. academic 


for the full transaction rather than limit orders or smaller market orders. Thorp, in a private com- 
munication, told Ziemba that he traded about $60 billion in statistical arbitrage from 1992-2002 in 
lot sizes of 20K to 100K and found that the mean transaction cost was about 1 cent per share and 
the market impact was about 4.5 cents per share for shares averaging about $30. So the one-way 
costs were about 0.18%. 


4 Calendar Anomalies and Arbitrage 


journals and the jobs in academic finance departments at the most famous business 
schools in the U.S. and Europe. 

The third camp is called Genius (G). These are superior investors who are bril- 
liant or geniuses but you cannot determine in advance who they are. The late MIT 
economist Paul Samuelson championed this argument. Samuelson felt that these 
superior investors do exist but it is useless to try to find them as in the search for 
them you will find 19 duds for every star. Surprisingly, Samuelson was an early 
investor in the very successful futures trading operation Commodity Corporation 
run, partly, by one of his MIT students. This view is very close to the Merton— 
Samuelson criticism of the Kelly criterion: that is, even with an advantage, it is 
possible to lose a lot of your wealth. See MacLean et al. (2011) for simulations 
discussing this point and Thorp and Ziemba’s (2012) response to private letters 
received from Samuelson. The evidence though is that you can determine some 
superior investors ex ante and to some extent they have persistent superior per- 
formance, see Fung et al. (2006), Jagannathan et al. (2006), Ziemba (2005) and 
Gergaud and Ziemba (2012). Soros did this with futures with superior picking of 
commodities and currencies to bet on; this is the traders are made not born philos- 
ophy. This camp will isolate members of other camps such as in (A) or (H). 

The forth camp is as strict in its views as camps (E) and (RP). This group feels 
that efficient markets that originated in and is perpetuated by the academic world 
is hogwash (H). In fact the leading proponent of this view, and one with whom 
it is hard to argue as he tops the lists of richest people and greatest investors, is 
Warren Buffett, who wants to give university chairs in efficient markets to further 
improve his own very successful trading. An early member of this group, the great 
economist John Maynard Keynes was an academic. We see also that although they 
may never have heard of the Kelly criterion, this camp does seem to use it implicitly 
with large bets on favorable investments. See MacLean, Thorp and Ziemba (2011). 
Ziemba and MacLean (2011) present a table of actual asset positions in a George 
Soros fund where about half the portfolio is invested in one position and equity 
weights near 10% for several positions are in Warren Buffett’s Berkshire Hathaway. 
They do not care about monthly losses, and have many, but they focus instead on 
high long term wealth growth. So they resemble Kelly bettors. 

This group feels that by evaluating companies and buying them when their value 
is greater than their price, you can easily beat the market by taking a long-term 
view. They find these stocks and hold them forever. They find a few such stocks 
that they understand well and get involved in managing them or they simply buy 
them and make them subsidiaries with the previous owners running the business. 
They forget about diversification because they try to buy only winners. They also 
bet on insurance when the odds are greatly in their favor. They well understand 
tail risk which they only take at huge advantages to themselves when the bet is 
small relative to their wealth levels. Indeed Buffett’s long-term approximately 15- 
18 year bet shorting S&P500 over-the-counter puts, for which he received much 
incorrect criticism, sure looks good now with the S&P500 closing 2011 at 1257.60 
and in the 1400 area in April 2012. The odds favor him not to have to pay back 
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the $4.6 billion premium he is now using for trades like the 10% loans to GE and 
Goldman Sachs. Calculations indicate that Buffett collected about twice the fair 
value of these American put options that cannot be exercised until expiry many 
years in the future. 

The last group are those who think that markets are beatable (A) through 
behavioral biases, security market anomalies and other research using computerized 
superior betting techniques. They construct risk arbitrage situations with positive 
expectation. They research the strategy well and follow it for long periods of time 
repeating the advantage many times. They feel that factor models are useful most 
but not all of the time and show that beta is not one of the most important variables 
to predict stock prices. They use very focused, disciplined, well researched strate- 
gies with superior execution and risk control. Many of them use Kelly or fractional 
Kelly strategies. All of them extensively use computers. They focus on not losing, 
and they rarely have blowouts. Members of (A) include Edward O. Thorp (Princeton 
Newport and later funds), Bill Benter (the Hong Kong racing guru), John Henry 
(the Red Sox owner), Blair Hull (the mispricing of options guru in Chicago), Harry 
McPike (a trend follower), Jim Simons (Renaissance hedge fund), Peter Muller (of 
Morgan Stanley) Jeff Yass (Susquehanna Group) and David Swensen (who runs 
the Yale University endowment). Most of these Ziemba has had dealings with or 
consulted for. This group has made many billions trading. Blowouts occur more in 
hedge funds that do not focus on not losing and true diversification and over-bet; 
when a bad scenario hits them, they get wiped out, such as Long-Term Capital Man- 
agement (LTCM), Niederhofer, and Amarath, etc.; see Chapters 11-13 of Ziemba 
and Ziemba (2007). 

So we will proceed assuming we are in category A and look at the data, possible 
explanations and some trading results. Statistical significance is frequently an issue 
even in long sequences of successful trades because of high standard deviations. 


1.2 January Effect 


We refer to the January effect as the tendency of small cap stocks to outperform 
large cap stocks in the month of January. Rozeff and Kinney (1976) showed that 
equally weighted indices of all the stocks on the New York Stock Exchange (NYSE) 
had significantly higher returns in January than in the other eleven months during 
1904-1974. Keim (1983) documented the magnitude of the size effect by month using 
1963-1979 data. He found that half the annual size premium was in January. Blume 
and Stambaugh (1983) showed that, after correcting for an upward bias in mean 
returns for small stocks that was common to earlier size effect studies, the size effect 
was only in January. Figure 1.1 shows the historical evidence from January 1926 to 
December 1995 of the difference in January between the lowest decile and the highest 
decile by market capitalization of the NYSE index plus American Stock Exchange 
(AMEX) and Nasdaq stocks of similar size. Only five years out of 70 did small 
caps underperform in January and in most years, the small cap outperformance is 
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R(10th Declle) - R(1st Declle) (Percent) 
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January Only 


Figure 1.1: January Effect, 1926-1995. January Size Premium = R(10'") — R(1*). 
Source: Booth and Keim (2000) 
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Figure 1.2: Russell 2000 — S&P500 Futures Spread Average Returns During Various Months, 
1993-2011. 


considerable. The Rigin — Rise decile returns averaged 4.48% with a t = 2.83 from 
January 1982 to December 1995. 

To update, we calculated the Russell 2000/S&P500 futures spread by month 
from 1993 to December 2011. All data in this chapter is updated to the end of 
2011. As argued by Rendon and Ziemba (2007), the January TOY effect still exists 
but has moved to December. Indeed, Figure 1.2 shows that the small cap/large cap 
spread is positive in December and negative in January. 
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Figure 1.3: S&P500 Futures Average Monthly Returns. 


The January monthly effect for small and large cap stocks measured by 
the Russell 2000 and S&P500 futures has been negative during January 1993 — 
December 2011 and January 2004 — December 2011. Figures 1.3ab and 1.4ab show 
the results with the data in Tables A.2 and A.3 in the Appendix. The results show 
the historically expected very negative October in the recent S&P500 data and 
in both sets of Russell 2000 data. Surprisingly, the historically strong months of 
November, January and February were negative for both the small and large cap 
data recently. While most of the other seasonality effects have still produced valuable 
reliable anomalies, the monthly effect does not look to be of much use for traders 
and investors. But sell in May and go away, discussed below, does add value. 
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Figure 1.4: Russell 2000 Futures Average Monthly Returns. 


Several subsequent analyses built on Keim’s study and considered the possibility 
that the January effect was diminishing based on the inclusion of later years of data, 
but Easterday, Sen and Stephan (2008) also expanded their study to include years 
before Keim’s analysis, which allowed them to better assess trends in the January 
Effects magnitude. They included the years from 1946-2007, performing a time 
series analysis according to the three sub-periods in relation to Keim’s 1963-1979 
window: before, during, and after. Over this period, they studied NYSE and AMEX 
firms and, from 1971 onwards, they also considered NASDAQ firms, which allowed 
them to consider more small cap stocks in their analysis. 
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Contrary to studies based on the Keim period and later years, Easterday et al. 
do not conclude that the January effect is declining. In other words, they do not find 
evidence that investors are acting on the arbitrage opportunity and internalizing it 
into higher prices. Instead, they find that the January effect continues to be robust 
in small firms and that, in recent years, it has not so much diminished as returned 
to a level similar to the effect exhibited prior to 1963. Easterday et al. also con- 
sidered trading volume in January, which should be higher if investors are actively 
arbitraging the January effect opportunity, but they did not find any evidence of 
higher trading volumes. 

Haug and Hirschey (2006) also extensively analyzed the January effect, using 
both value-weighted and equal-weighted equity returns. Their findings concur with 
Easterday and particularly note the consistency of the January effect in small cap- 
italization stock returns across time. For instance, they find that the difference in 
average mean value-weighted portfolio return is 0.40% from 1802-2004, and that 
this number is even greater, 0.61%, from 1952-1986 (roughly the period Keim stud- 
ied expanded two-fold). 

Haug and Hirschey explore potential explanations of the January effect phe- 
nomenon, ruling out biases that would more markedly affect large capitalization 
stocks, such as the timing considerations of institutional investors during portfolio 
rebalancing around official reporting periods. Statistical arguments brought up by 
Sullivan, Timmerman and White (1999), among others, center around the inher- 
ent statistical problem of testing an empirical aspect of a data set using the same 
data set, which fundamentally calls into question the underlying statistical methods 
used in the analysis. Additionally, two theories concerning relatively small investors 
concern end-of-year tax considerations or income events, such as year-end bonuses, 
which lead to new purchases in the new year. However, both of these potential expla- 
nations come into question when considering international indices under different 
tax regime timing and across changes in tax laws that should have an effect. 

Among other measures, Haug and Hirschey use the Tax Reform Act of 1986 to 
test these and other behavioral hypotheses as potential explanations of the January 
effect, but they reach contradicting conclusions using different data, namely value- 
weighted and equal-weighted returns. They ultimately conclude that each of these 
explanations remain potential but unproven drivers of the still perplexing January 
effect phenomenon. 


1.2.1 Trading the January small cap effect in the futures markets 


The evidence suggests that small stocks outperform large stocks at the turn of the 
year. Yet, transactions costs, particularly bid-ask spreads and price pressures, take 
away most, if not all, of the potential gains. See e.g., Stoll and Whaley (1983) and 
Keim (1989) on these effects in January. However, transaction costs to trade index 
futures are a tenth or less of those for the corresponding basket of securities, and 
even more important, there is much less market impact. Hence, it may be profitable 
to buy long positions in small stock index futures and sell short positions in large 
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stock index futures. This pair of positions is known as a spread trade. The strategy 
must anticipate the effect in the market place, in particular, the price impact of 
buying and selling futures contracts. Stock index futures began trading in the U.S. 
in May 1982. The Value Line minus S&P500 and Russell 2000 minus S&P500 spreads 
are two ways to measure and possibly capture any advantage small stocks may have 
over large cap stocks. 

Ziemba started doing this trade for the 1982/1983 TOY with Vancouver com- 
modity broker Ross Clark. Clark and Ziemba (1987) describes these markets and 
the Value Line/S&P spread. See also the synopsis of this trade in Chapter 6. At 
that time the Value Line, which had about 1700 stocks was geometrically weighted. 
Hence, by the geometric-arithmetic inequality this produced a downward drift of 
about i% per month. After 1988 the index became price weighted arithmetic. Clark 
and Ziemba used the following trading rule: 


buy the spread on the first closing uptick, starting on December 15 and 
definitely by the 17**, and sell on January 15. Waiting (to enter) until 
(—1) now seems to be too late: possibly finance professors and their col- 
leagues, as well as other students of the turn-of-the-year/January effect 
who are in on the strategy, move the VL index. There seems to be a bid- 
ding up of the March VL future price relative to the spot price. (Clark 
and Ziemba, 1987:805) 


Their idea at that time was that the January small firm effect existed and occurred 
during the first two weeks of January in the cash market (as argued by Ritter, 1988; 
see also comments by Ziemba, 1988), but that futures anticipation would move the 
effect in the futures markets into December. Hence, an entrance into the Value 
Line/S&P500 futures spread trade in mid-December and an exit in mid January 
should capture the effect if it actually existed. With data up to the 1985/1986 TOY, 
their trade rule was successful. They concluded that small cap advantage was mainly 
in the first half of January, with some anticipation in the final days of December, 
and with a large cap advantage in the second half of January. 

Ziemba continued trading this spread for the 14 TOY’s (all winners) from 1982- 
1983 to 1995-1996 and updated the results in Ziemba (1994) and Hensel and Ziemba 
(2000). 

Hensel and Ziemba (2000) analyzed the January effect in the futures markets 
and concluded that for the 1980s and early 1990s there was a small cap advantage 
in the futures and cash markets. However, they show that from 1994 to 1998 there 
was no advantage in the cash market, and that anticipation built up in the last half 
of December in the futures markets. As a consequence, for the four TOYs during 
the 1994-1998 period, the January effect only existed in the last half of December, 
in the futures market. They analyzed small minus large spread trades between the 
Value Line and the S&P500 futures contracts and concluded that the January effect 
was exploitable in the futures markets in this period. 

Rendon and Ziemba (2007) updated Hensel and Ziemba (2000) to analyze the 
seven TOYs from 1998-1999 to 2004-2005 for the Value Line minus S&P500 spread 
trade, and provided additional evidence by analyzing a second spread trade involving 
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the Russell 2000 and the S&P500 futures contracts. From 1998 to 2005, their analysis 
shows that the January effect is still present in the futures markets in the Value 
Line minus S&P500 spread trade, but that it has become increasingly risky to try 
to exploit it because of the marginal liquidity of the Value Line stock index futures 
contract. For the Russell 2000 minus S&P500 spread trade, the January effect has 
been profitable. 

Rendon and Ziemba (2007) investigated the day by day cash and futures returns 
of the spread trades: Value Line minus S&P500 and Russell 2000 trading, as well as 
for the 12 TOYs for which there was Russell 2000 futures trading up to 2004/2005. 

Rendon and Ziemba (2007) computed the cash index and March futures spreads 
(Value Line 500 minus S&P500, Value Line 100 minus S&P500, and Russell 2000 
minus S&P500) day by day in December and January. The 500 and 100 refer to the 
contract size, $500 or $100 per point, respectively. The spread between the futures 
difference and the cash difference represents the futures anticipation. Table 1 in 
Rendon and Ziemba (2007) (not reprinted here) summarizes, for each spread, the 
anticipation or lack of it in the second half of December, during the TOY (trading 
days —1 to +4), the rest of the first half of January and the second half of January 
for all the TOYs. Some years were easy and others hard to make profits. Figure 1.5 
shows eight typical turn-of-years plots for December and January: 


1. A typical trade for 1999/2000 showing the futures and cash spreads (TOY) for 
the Value Line 500 minus S&P($500 value). The dotted line is the futures spread 
and the dark line is the cash spread. In this case you could enter at a discount 
in mid December. The trade gained but you had to cash out in mid January at 
a discount. 

2. The 1997-1998 TOM also for the VL500/S&P500 spread had all the advantage 
in December then the spread declined all January. You had to buy at a premium 
but you could sell for fair value. If you did not sell at the end of December you 
would have lost some or all of your gains. Traders seeing such a trend in January 
would likely have cashed out. 

3. The VL100/S&P500 for 2003-2004. The TOM had virtually no futures spread 
discounts or premiums. Since the spread increased continuously from 
December 17 traders easily made profits. 

4. The TOM for 2000-2001 for the VL100/S&P500 value-adjusted for the cash and 
March futures. The spread with some volatility increased throughout December 
and January so the trade made profits regardless of when it was entered and 
exited. There were premiums or discounts throughout December and January. 

5. In 1994-1995 for the R2000/S&P500 there were large gains in December and the 
spread peaked at the end of December. Those who did not cash out then lost 
some of their gains in January. 

6. In 1993-1994 for the R2000/S&P500 spread you had to buy at a premium. Then 
the spread fell in December and non-true believers cashed out with losses but 
believers won when the spread gained in January and traders likely could have 
traded out at a futures spread discount. 
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7. The TOY for 2004-2005 for the R2000/S&P500 spread won in December and 
you could have bought the spread at a discount. But if you held the spread into 
January you would have given back some or all of your gains or even lost money. 

8. In 2001-2002 you had to buy the R2000/S&P500 spread at a premium but the 
trend was up until mid January. So traders made profits. 


The anticipation year by year patterns vary greatly. For the Value Line 500 
minus S&P500 spread, Hensel and Ziemba (2000) reported an apparent shift from 
an anticipation in the first half of January in the early years, to an anticipation 
during the second half of December in the late 90s, as well as a lack of small cap 
anticipation past the TOY, in January. For this spread trade, our results show little 
or no anticipation in the contracts last two years of existence. This was, most likely, 
related to the scarce liquidity of the contract in its final months. 

For the Value Line 100 minus S&P500 spread there is a positive anticipation dur- 
ing the second half of December to the TOY period. Further from this point, the 
results are very mixed, with an apparent dominance of positive anticipation, but 
with a clear trend towards no anticipation, especially in the 2003-2004 TOY, where 
there was little or no anticipation during the first half of December, and no antici- 
pation at all in the rest of the analyzed period. This is, again, consistent with the 
diminishing liquidity of the Value Line 100 contract, which reached extreme values 
in the 2004-2005 TOY, with an average of less than ten contracts traded daily and 
an average open interest of less than one hundred contracts. 

For the Russell 2000 minus S&P500 spread, positive anticipation dominates in 
all the time intervals for which we divided the TOY period. In the three TOYs, 
2002-2003, 2003-2004, and 2004-2005, negative anticipation dominated, but its 
size is not significant. Since 2000, the pattern seems to be changing from positive 
anticipation throughout the period to none or insignificant negative anticipation. 

For the Value Line 500 minus S&P500 spread, the three TOYs up to the ter- 
mination of this higher-multiplier contract in March 2000, showed a reversion to 
the original January effect. The declining liquidity of the contract up to its date of 
termination is clearly a factor in this reversal. This made the trade very risky and 
volatile. Large caps had the advantage in the second half of December and January, 
but small caps outperformed large caps during the (—1) to January 15 period. On 
average, the Clark and Ziemba (1987) rule would have yielded profits over the entire 
1982/1983 to 1999/2000 sample period, although this result is not statistically sig- 
nificant. This trade was successful in all but two of the TOYs in this sample. The 
trade produced statistically significant profits in the 14 TOYs from 1982-1983 to 
1995-1996, with a mean gain of 4.25 spread points, standard deviation of 2.81, a t 
value of 1.51 and 14 of 14 winners. The results are not statistically significant in the 
complete sample period because of losses in the 1996-1997 and 1997-1998 TOYs 
and high variability. 

For the Value Line 100 minus S&P500 spread, the results show that there was 
still a small cap advantage in the second half of December, followed by a large cap 
advantage in January. On average, this trade continued to be profitable since it 
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produced positive profits in all but one of the 7 TOYs studied. However the results 
were not statistically significant because of high variability. Moreover, as pointed 
out by Hensel and Ziemba (2000), the trade became increasingly riskier and volatile 
as the liquidity of the Value Line futures contract diminished. 

The Russell 2000 minus S&P500 trade is very interesting. There seems to be a 
clear pattern of anticipation of the January effect on this spread. The results show 
a definite large cap advantage in the second half of January, as it was in the earlier 
Value Line minus S&P500 studies. For the rest of the sub periods, the results are 
as follows: Out of 12 TOYs, large caps had the advantage in eight years during 
the (—1) to January 15 period. On the other hand, small caps had an advantage 
in 9 out of the 12 TOYs for the December 15 to (—1) period. On average, the 
Clark and Ziemba (1987) rule produced positive profits, although this result is not 
statistically significant. A special case is the December 15 to (—3) period illustrated 
in Table 1.1. In this case, the spread trade produces profits that are close to being 
statistically significant at the 10% level. This result suggests that the Hensel and 
Ziemba (2000) modification to the Clark and Ziemba (1987) rule for the Value Line 
vs. S&P500 spread trade should be modified so as to unwind the position at the (—3) 
day of the TOY. The 1999-2000 and 2000-2001 TOYs in the Russell 2000 sample 
were particularly strong, and introduce high variability in the sample. Tables 1.1 
and 1.2 show the mean gain, standard deviation and t statistic for the spread trade 


Table 1.1: Results From Russell 2000/S&P500 March Futures Spread Trades in Index Points on 
Various Buy/Sell Dates for the 18 Turn-of-the-Years 1993-1994 to 2010-2011. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—3) Jan 15 end Jan Jan 15 S&P vs. R2 
1993-1994 3.39 (2.22) 1.84 1.17 0.52 v 0.48 
1994-1995 8.03 (3.93) (5.49) 4.10 0.52 v 0.48 
1995-1996 5.26 (7.68) (2.96) (2.42) 0.51 v 0.49 
1996-1997 2.84 (6.02) (7.20) (3.18) 0.51 v 0.49 
1997-1998 6.12 (8.47) (6.52) (2.35) 0.53 v 0.47 
1998-1999 12.75 2.78 (15.33) 15.53 0.59 v 0.41 
1999-2000 23.85 9.17 12.30 33.02 0.61 v 0.39 
2000-2001 22.94 8.07 8.98 31.01 0.59 v 0.41 
2001-2002 2.92 (3.29) 5.15 (0.37) 0.45 v 0.55 
2002-2003 2.78 (6.27) 5.72 (3.49) 0.47 v 0.53 
2003-2004 2.17 17.69 (4.30) 19.86 0.51 v 0.49 
2004-2005 0.57 (19.02 8.12 (18.45) 0.52 v 0.48 
2005-2006 2.72 11.99 28.86 14.71 0.52 v 0.48 
2006-2007 0.36 0.13 1.39 0.23 0.53 v 0.47 
2007-2008 16.48 (24.66 17.77 (8.18) 0.51 v 0.49 
2008-2009 28.18 6.85 (8.62) 21.33 0.51 v 0.49 
2009-2010 14.64 1.20 (2.66) 15.84 0.51 v 0.49 
2010-2011 1.91 1.04 (19.15) 2.95 0.55 v 0.45 
Average 8.77 2.03 0.99 6.74 

StDev 8.86 10.28 11.70 13.85 

t stat 0.99 0.20 0.08 0.49 
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Table 1.2: Results From VL100/S&P500 March Futures Spread Trades in Index Points on Var- 
ious Buy/Sell Dates for the 19 Turn-of-the- Years 1982-1983 to 1999-2000. 


Difference Difference Difference Trade Gain Trade 

Dec 15 (—1) to Jan 15 to Dec 15 to Weights 
TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. VL 
1982-1983 1.50 3.05 0.65 4.55 0.5 v 0.5 
1983-1984 (0.70) 4.00 (4.90) 3.30 0.5 v 0.5 
1984-1985 1.10 3.55 2.90 4.65 0.5 v 0.5 
1985-1986 3.15 0.45 (2.60) 3.60 0.5 v 0.5 
1986-1987 2.75 (0.30) (9.85) 2.45 0.5 v 0.5 
1987-1988 8.15 (0.90) (0.25) 7.25 0.5 v 0.5 
1988-1989 3.50 (2.95) (1.70) 0.55 0.5 v 0.5 
1989-1990 (0.50) 1.85 (1.45) 1.35 0.5 v 0.5 
1990-1991 1.70 3.60 3.20 5.30 0.5 v 0.5 
1991-1992 (7.15) 10.20 13.80 3.05 0.5 v 0.5 
1992-1993 5.45 6.55 4.05 12.00 0.5 v 0.5 
1993-1994 4.65 a 1.15 4.65 0.5 v 0.5 
1994-1995 6.15 (1.65) (8.50) 4.50 0.5 v 0.5 
1995-1996 6.00 (3.75) (9.70) 2.25 0.5 v 0.5 
1996-1997 4.35 (15.90) (10.75) (11.55) 0.5 v 0.5 
1997—1998 10.70 (6.50) (12.30) 4.20 0.5 v 0.5 
1997-1998 1.01 (6.40) (6.42 (5.39) 0.64 v 0.36 
1998-1999 0.91 2.17 (28.19 3.08 0.60 v 0.40 
1999-2000 6.09 14.00 7.33 20.08 0.59 v 0.41 
Average 
1982-1998 3.18 0.08 (2.27 3.26 
Std Dev 4.10 5.90 6.89 4.74 
t stat 0.78 0.01 (0.33 0.69 
Average 
1998-2000** 2.67 3.26 (9.09 5.93 
Std Dev 2.96 10.24 17.91 12.97 
t stat 0.90 0.32 (0.51 0.46 


excluding these years. The January effect could be exploitable in the futures markets 
through the Russell 2000 minus S&P500 spread trade, which presents the additional 
advantage of greater liquidity, when compared to the Value Line vs S&P500 spread 
trade. There are periods for which the Value Line minus S&P500 trade had gains 
and the Russell 2000 minus S&P500 had losses. Although this is not the case in 
the years after 2000, when the trades have started to look very similar, it is not 
contradictory because the contracts are made up of different stocks, and the Russell 
2000 contract has a much higher liquidity. 

Tables 1.2 and 1.4 take data from Hensel and Ziemba (2000) and update the 
results for the Clark and Ziemba (1987) rule for the Value Line 500 minus S&P500 
and Value Line 100 minus S&P500 spread trades. All fourteen of the TOY trades 
from 1982-1983 to 1995-1996 showed profits. Figures 1.5a—h show the anticipa- 
tion, etc. But because of declining Value Line volume and consulting to Morgan 
Stanley in New York where he taught the Peter Muller group about the TOY and 
other trades, that group became very successful and had gains in the $5 billion 
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Table 1.3: Results From Russell 2000/S&P500 March Futures Spread Trades in Index Points 
on Various Buy/Sell Dates for the 12 Turn-of-the-Years 1993-1994 to 2004-2005. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. R2 
1993-1994 4.23 (2.22) 1.84 2.01 0.52 v 0.48 
1994-1995 5.84 (3.93) (5.49) 1.91 0.52 v 0.48 
1995-1996 5.28 (7.68) (2.96) (2.39) 0.51 v 0.49 
1996-1997 (1.63) (6.02) (7.20) (7.65) 0.51 v 0.49 
1997—1998 7.43 (8.47) (6.52) (1.04) 0.53 v 0.47 
1998-1999 9.74 2.78 (15.33) 12.51 0.59 v 0.41 
1999-2000 27.66 9.17 12.30 36.83 0.61 v 0.39 
2000-2001 27.10 8.07 8.98 35.17 0.59 v 0.41 
2001-2002 7.84 (3.29) 5.15 4.55 0.45 v 0.55 
2002-2003 (0.93) 6.27) 5.72 (7.20) 0.47 v 0.53 
2003-2004 1.06 17.69 (4.30) 18.76 0.51 v 0.49 
2004-2005 (0.34) (19.02 8.12 (19.36) 0.52 v 0.48 
Average 7.77 1.60 0.03 6.17 

Std Dev 9.86 9.72 8.24 16.96 

t stat 0.79 0.16 0.00 0.36 

without 2000 and 2001 

Average 3.85 3.64 (2.10) 0.21 

Std Dev 4.05 9.35 7.24 10.69 

t stat 0.95 0.39 (0.29) 0.02 


Table 1.4: Results From VL100/S&P500 March Futures Spread on Various Buy/Sell Dates for 
the 7 Turn-of-the- Years 1998-1999 to 2004-2005. 


Difference Difference Difference Trade Gain Trade 
Dec 15 (—1) to Jan 15 to Dec 15 to Weights 

TOY to (—1) Jan 15 end Jan Jan 15 S&P vs. VL 
1998-1999 0.35 1.15 (11.26) 1.50 0.23 v 0.77 
1999-2000 2.29 5.26 2.76 7.55 0.22 v 0.78 
2000-2001 8.68 15.81 7.72 24.49 0.24 v 0.76 
2001-2002 6.00 (2.13) 4.62 3.87 0.29 v 0.71 
2002-2003 0.66 (0.11) 0.50 0.54 0.32 v 0.68 
2003-2004 1.45 7.16 (0.87) 8.61 0.36 v 0.64 
2004-2005 4.42 (11.20) 6.31 (6.78) 0.37 v 0.63 
Average 3.41 2.28 1.40 5.68 

Std Dev 3.10 8.41 6.36 9.73 

t stat 1.10 0.27 0.22 0.58 


area (see Patterson, 2010). Ziemba stayed out of these markets until the 2009-2010, 
2010-2011 and 2011-2012 TOYs with the Russell 2000 mini futures. Figure 1.6 
shows these three trades, the round dots denote the entry and the squares the 
exit in our test anomaly account run by the authors; see Figure 1.29. So the old 
Clark-Ziemba trade modified seems to still work. Of course, there are lots of imple- 
mentation issues. 
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Figure 1.6: Russell 2000 — S&P500 Spread With Our Entries (Dots) and Exits (Squares). 
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1.3 The January Barometer 


Historically, returns in January have been a valuable signal for the returns in the 
following eleven months that year. If stocks have positive returns in January, then 
it is likely that the market as a whole will rise in that year. Hirsch (1986), who first 
mentioned this in 1972, has called this the January barometer. In the yearly updated 
Stock Trader’s Almanac by Jeffrey A. Hirsch and Yale Hirsch (2011), they define it 
as the full year rather than the last eleven months. We look at this both ways, full 
year and last eleven months, for the US S&P500 in Figure 1.8. The supposition is 
that: 


If the market rises in January, then it will rise for the year as a whole; but if it falls 
in January, then there will be a decline or a flat market that year. 


Figure 1.7 updates Hensel and Ziemba (1995a) and Ziemba (1994) which had the 
results for the 54 years 1940-1993. There are 72 years in the total sample with an 
18 year update to the end of December 2011. 

For the 72 years, when the return in January was positive, the rest of the year 
was up 84.4% of the time. This compares with 69.4% of all the years that the whole 
year was up. 

When the return in January was negative which was 27 of the 72 years, the 
rest of the year was down 48.1% of the time. Thus even in years when January 
is down, the whole year is about equally likely to be up or down. This 48.1% is 
significantly less than the 72.2% of all the years that the rest of the year went up. 
Figure 1.7 also shows the full year return for the four cases with arithmetic and 
geometric mean returns. We conclude that the January barometer does add value 
and is useful in various ways. Negative Januarys like 2008 had good predictive value. 
But the measure is not infallible. For example, 2010 had positive 11 month and 12 
month returns despite a negative January. But as in other cases of negative January 
but positive 11 and 12 month returns, those returns are, on average, small. 

In the 18 year update (1994-2011), the results as seen in Figure 1.7 are similar 
with the January up ROY up 72.7% (8 of 11) of the time. 

Figure 1.8(a) shows the cumulative rest of year returns for positive January, 
negative January and buy and hold. Historical buy and hold beats positive January 
and has the highest final wealth with negative Januarys producing almost no gains 
at all. Buy and Hold had returns that were high except the 2007-2009 drop in the 
S&P500 led to the positive January dominating. Figure 1.8(b) has the full year 
results. 

The equations for Figures 1.9 and 1.10 are: 


ROY = 0.0591 +0.794Jan R? = 6.5% 
(3.41) (2.21) 

ROY, = 0.1298 —0.324Jany R? = 0.7% 
(4.42)  (—0.55) 

ROY _ = —0.0840 —1.936Jan_ R? = 7.2% 
(—1.36)  (—1.39) 
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11 month 12 month 
mean return mean return 
arithmetic arithmetric 
geometric geometric 
38 years (84.4%) 15.1% 19.6% 
ROY up 14.8% 19.2% 
45 years January 
up 
7years (15.6%) -9.4% -5.4% 
ROY down 9.6% -5.6% 
72 years total sample 
1940-2011 
14years (51.9%) 11.2% 6.3% 
ROY up 10.8% 5.9% 
27 years January 
down 
13years (48.1%) -13.9% -16.7% 
ROY down -14.4% -17.2% 
(a) Total Sample 
11 month 12 month 
mean return mean return 
arithmetic arithmetric 
geometric geometric 
8years (72.7%) 16.4% 19.7% 
ROY up 16.0% 19.3% 
11 years January 
up 
3 years (27.3%) -7.6% -4.8% 
ROY down -7.8% 5.0% 
18 years update 
1994-2011 
4years (57.1%) 21.9% 16.4% 
ROY up 21.4% 16.0% 
7 years January 
down 
3 years (42.9%) -20.6% -24.0% 
ROY down -21.5% 24.9% 


(b) Update 


Figure 1.7: January Barometer Results, 1940-2011 and 1994-2011. 


Bronson (2011) reminds us that the January barometer has had six false positives 
since 1940, where January was up but the rest of the year was negative. In 1947, 
there were enough dividends and January returns to overturn this loss for the whole 
year. So that leaves the following five January positive net return of the year negative 
returns (not including dividends) as in Table 1.5. 

There have also been 14 false negatives since 1940 to 2010 where January was 
negative but the rest of the year is positive. We differ from Bronson by simply saying 
that if January is negative, the rest of the year is noise. So Bronson argues that the 
January barometer has failed to signal the direction of the stock market 19 of the 
past 71 years up to 2010, some 27% of the time. Agreeing with us, the stock market 
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Figure 1.8: Positive and Negative January and B&H Cumulative Returns. S&P500 Index (Cash), 


1940-2011. 


was up 73% of the time (52 of 71 years). But Bronson argues that the barometer 
is getting less accurate recently. Indeed 12 of the 19 failures (60%) have occurred 
in the 32 years since 1978. Figure 1.11 shows Bronson’s graph relating the rest of 
the year percent change as a function of January’s percent change. His regression 
suggests that the rest of the year percent change equals 6% plus 80% of January’s 
percent change. Compare this with our regression above with a minuscule 0.5% R? 
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Figure 1.9: January Return (x-axis) vs. Rest of Year Return (y-axis). S&P500 Index (Cash), 
1940-2011. 
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Figure 1.10: January Return (x-axis) (Up and Down Cases) vs. Rest of Year Return (y-axis). 
S&P500 Index (Cash), 1940-2011. 


and a rest of the year return of 12.6% minus 0.27 times January’s return. His low 
7% R? exceeds ours. He concludes that January 2011’s gain of 2.3% yields a forecast 
of a rest of 2011 gain of 7.8%. This gain is quite close to those we hear from the 
TV forecasters. 

Hirsch and Hirsch (2011) and Ziemba (2010) discuss this first five day of January 
predictor with data from 1950-2010. The last 37 positive first five days were followed 
by full year gains 32 times (86.5%) and a mean gain of 14.0% for the 37 years. The 
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Table 1.5: Returns for Positive 
January, Negative Rest of Year, %. 


Year January ROY Year 


1946 7.0 —17.6 —11.9 
1947 2.4 —2.3 0.0 
1966 0.5 —13.5 —13.1 
1987 13.2 —9.9 2.0 
1994 3.3 —4.6 —1.5 
2001 3.5 —16.0 —13.0 


Source: Bronson (2011) 


Regression Analysis of January (x-axis) with Rest of Year (y-axis) 
Rest-of-year % change equals +6.0% plus 80% of January's % change witha very The Jan"11 gain of 2.3% suggest the rest of 2011 will 


i E 9; = 0; 2-79 i likely gain 7.8%, but the very high standard deviation of 
high standard deviation of +/-14.9%, r= +26% and r? = 7%, which are very low 14.9% suggests the odds are two-thirds that the restof 
m year will come in between -7% and +22%, and one-third 
The 2nd- thru 6th-degree polynomial best fits are also graphed (maximum r? = 32%), illustrating odds that it will be between -22% and +37%. 
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Figure 1.11: Regression Analysis of January Return With Rest of Year. 


Source: Bronson (2011) 


23 negative first five days had 12 positive (52.2%) years and 11 negative years. The 
full month January barometer has been an even better predictor except in 2009 so 
we have focused on that but the five days is important to look at. To conclude, the 
2008 signal was the strongest at —5.3%, the negative first five days were the worst 
ever, and the negative January led to the very devastating 2008 with a yearly loss 
of —38.5% for the S&P500. The first five days of 2012 was a positive period. 

The results we have found are supplemented with other studies as follows. 

Brown and Luo (2006) consider the performance of the January barometer 
(JanB) in the U.S. from 1941-2003 and find it has predictive ability. More recently, 
Stivers, Sun, and Sun (2009) find, using the simple spread approach, the power of 
the JanB in U.S. indices has declined since it was published in the early 1970s but 
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that it remains a useful market timing technique in the 1975-2006 period. Addi- 
tionally, Sturm (2009) shows the JanB is particularly powerful in the first year of 
the presidential cycle. 

Cooper, McConnell and Ovtchinnikov (2006) focus on the 1940-2006 period and 
consider the robustness of results using NYSE data dating back to 1825. They call 
the effect the “other January effect” but we prefer January barometer. In addition 
to testing the JanB with the full market index, Cooper et al. also find it has predic- 
tive value for both small and large stocks and value and growth stocks. The effect 
persists after adjustment for business cycle and macroeconomic variables, investor 
sentiment, and the presidential cycle. 

Cooper et al. found that over the previous 147 years, the spread between the 
11-month return following positive versus negative January’s was 7.76%, and other 
papers have reported spreads of 10%+. Though consensus exists around this con- 
clusion that January returns have a predictive power, the consensus dissipates at 
the crucial point: Can you profit from it? 


1.3.1 How to trade the January Barometer (JanB) 


The apparent predictive power of the JanB leads us to if and how investors should 
trade to profit by it. The following strategies have been analyzed as ways to use the 
JanB to outperform a passive buy-and-hold strategy (see also Figure 1.8): 


1. Standard JanB strategy: Stay out of the market in January and go long or short 
for the remainder of the year depending on if the January return is positive or 
negative. 

2. JE+JanB strategy: Go long in January based on the original January effect 
(because market returns in January are positive on average) and, for the remain- 
der of the year, follow the standard JanB strategy. 

3. JE + JanB T-bill strategy: Go long in January based on the original January 
effect. If the January return is positive, go long. If the January return is negative, 
invest in t-bills. 


Regarding the standard JanB strategy, Marshall et al. and Cooper et al. came to 
a similar conclusion that a passive buy-and-hold strategy beats the standard JanB 
strategy. This result is unsurprising given that this strategy calls for staying out of 
the market in January to wait for the market signal before investing from February 
through December. By skipping January, the investor misses the excess returns 
experienced in January as documented as the original January effect. 

Therefore the two subsequent strategy alternatives both assume the investor 
goes long in January in order to benefit from the January effect. 

The JE+JanB strategy also underperforms a simple buy-and-hold strategy. 
Cooper et al. report that, while it was much lower than the return after a positive 
January, the average yearly return after negative Januarys was also positive at 
5.71%, so a short strategy in these periods earns less than the ¢-bills or the buy- 
and-hold strategy. Cooper et al. also point out the substantial losses from tail risk in 
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very good years occurring after negative Januarys, when shorting the market would 
have been disastrous for the investor. 

Both Cooper et al. and Marshall et al. find the JE + JanB T-bill strategy to be 
the best of the three above alternatives, but they disagree on whether this strategy 
is superior to the buy-and-hold baseline. 

For example, during the period from 1940 to 2006, Cooper et al. report average 
annual buy-and-hold returns of 11.94% compared to a yearly average of 12.79% for 
the JE+ JanB T-bill strategy. Cooper et al. conclude that following this strategy is 
of value to investors based on the past data. 

Marshall et al. reach the opposite conclusion through the same data, finding 
that the JanB cannot be used by investors to outperform a passive long strategy. 
For the period from 1940 to 2007, they find average yearly buy-and-hold returns 
of 12.68% and JE + JanB T-bill strategy returns of 13.09%. Marshall et al. say 
the discrepancies between the two conclusions are based on dissimilar statistical 
significance and risk assessment calculations as well as differing opinions of the 
economic significance of < 0.05% difference in annual return, which excludes transac- 
tion costs and uses simple spreads that they find biased away from investors trading 
realities. 

The only way the JE+ JanB T-bill strategy differs from the buy-and-hold strat- 
egy is that, following Januarys with negative returns, the investor opts for T-bills 
instead of equity. The former strategy does not outperform the buy-and-hold strat- 
egy because, following Januarys with negative returns, average 11-month T-bill 
returns are only marginally larger than average 11-month equity returns. 


1.3.2 The international January Barometer 


Hensel and Ziemba (1995b), Easton and Pinder (2007), and Stivers, Sun, and Sun 
(2009) address the performance of the JanB in international markets. Hensel and 
Ziemba find similar results in Switzerland and Europe and global as the U.S. 
Namely, about 85% positive years and rest of years following positive Januarys 
and noise about 50-50 following negative Januarys. The mean returns have the 
same basic behavior, being more favorable for positive than for negative Januarys. 
So positive Januarys do seem to have positive predictive power. 

Bohl and Salm (2010) study the predictive power of stock market returns in 
January for the rest of the year for 19 countries. They find that the barometer works 
well in the U.S., as we know, and in Norway and Switzerland. But it did not predict 
well in the other 16 countries which included Japan, France, Spain and Germany. 

The data periods vary by country but are long, for example, Australia 1903-2007, 
Austria 1970-2007, Belgium 1951-2007, Canada 1936-2007, and France 1896-2007. 
In many cases it is the high sigma leading to too low t’s which caused the significant 
non-predictability. But in some cases, the signal is actually going the wrong way 
in their regression model. So care is needed in these various countries to use the 
barometer for added value. 
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1.4 Sell-in-May-and-go-away 


September and October have historically had low stock market returns with many 
crashes occurring in October. November to February have historically had higher 
than average returns; see Gultekin and Gultekin (1983), Keim and Ziemba (2000), 
and this book. This suggests the strategy to avoid the bad months and be in cash 
then and only be long the stock market in the good months. Sell-in-May-and-go- 
away, also called the Halloween effect, is one such strategy. Figures 1.12 and 1.13 
show this strategy using the rule sell on the first trading day in May and buy on 
the 6th trading day before the end of October, for the S&P500 and Russell 2000 
futures indices for the years 1993-2011, respectively. This rule did beat a buy and 
hold strategy. Appendix Tables A.2 and A.3 show the monthly returns, respectively, 
for those 18 years. Tables A.6 and A.7 have the SIM versus buy and hold data from 
Figures 1.12 and 1.13. 

For the S&P500 a buy and hold strategy turns $1 on February 4, 1993, into 
$1.91 on December 31, 2011; whereas, sell in May and move into cash, counting 
interest (Fed funds effective monthly rate for sell in May) and dividends for the buy 
and hold, had a final wealth of $4.03, some 111.6% higher. For the Russell 2000, the 
final wealths were $1.83 and $5.35, respectively, some 192.3% higher. Historically, 
as shown below in the Presidential effects discussion, the SIM rule is not as reliable 
in Presidential election years but it did work in 2008. There are many discussions 
of the SIM rule with differing data years and differing entry and exit rules. Some 
suggest scrapping the rule in 2012, an election year. But with a big stock run up 
based on cheap Fed money and huge problems in Europe and elsewhere, being out 
of the market, in cash, seemed wise. Indeed, when we went to press on Monday May 
28, 2012, the S&P500 was down over 5% in May. 
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Figure 1.12: S&P500 Futures SIM and B&H Cumulative Returns Comparison 1993-2011 (Entry 
at Close on 6*? Day Before End of October; Exit 18* Day of May). 
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Figure 1.13: Russell 2000 Futures Sell in May (SIM) and B&H Cumulative Returns Comparison 
1993-2011 (Entry at Close on 6t! Day Before End of October; Exit 15% Day of May). 


Bouman and Jacobsen (2002) confirm that the SIM effect holds in 36 of the 37 
countries Vacation timing is a potential cause of the effect, suggesting the timing 
of summer vacations may cause temporal variations in appetites for risk aversion. 
However, they find evidence of the effect in their subset of Southern hemisphere 
countries, which under their hypothesis would be expected to have a different sea- 
sonal pattern. 

Seasonal Affective Disorder (SAD) was studied in Kamstra, Kramer, and Levi 
(2003) and Garret, Kamstra and Kramer (2004). SAD is a disorder in which the 
shorter, relatively sunless days of fall and winter cause depression, which recent 
research links to an unwillingness to take risk. Kamstra (2003) concludes that the 
SAD explanation does not lead to a profitable trading strategy because the risk pre- 
mium varies with the seasonal effects. Like the vacation timing hypothesis, Doeswijk 
finds the SAD hypothesis insufficient because SAD is known to start as early as 
September so the historically high November returns cannot be explained. 

Doeswijk (2005) posits that, in the fourth quarter of each year, investors are 
overly optimistic about the upcoming year. This optimism leads to attractive initial 
returns followed by a renewed realism that readjusts expectations. Unlike the SAD 
hypothesis, which suggests a varying risk premium, the Optimism Cycle hypothesis 
reflects a constant risk premium with a varying perception of the economic outlook. 
To test this hypothesis, Doeswijk ran three analyses: 1) the global zero-investment 
seasonal sector-rotation strategy; 2) the seasonality of earnings growth revisions; 
and 3) the initial returns of IPOs as a proxy for investor optimism. 

If this hypothesis is correct, a winning investment strategy is going long in 
cyclical stocks and short in defensive stocks during November through April (winter) 
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and following the opposite strategy from May through October (summer). (These 
stock groups are chosen for their relative exposure to the general economy, with 
cyclical stocks having a high exposure and defensive stocks a low exposure.) To test 
this strategy, Doeswijk uses the MSCI World index of global stock returns from 
1970-2003 and tests the data as a whole, in two 17-year sub-periods, using several 
variations on timing of the winter period, and various sector definitions. The study 
runs regressions using monthly market capitalization-weighted price return indices 
and their monthly log returns. 

Doeswijk finds that, on average during the study period, winter returns are 
a significant 7.6% higher than summer returns and the strategy works in 65% of 
the years. On a monthly basis, average performance of the global zero-investment 
strategy is 0.56%, which is significant at the 1% confidence level. Using further 
regression analysis techniques, Doeswijk also isolates the market timing effects from 
the seasonality and finds that seasonality alone accounts for approximately half of 
the excess returns. 

Both analyses by Doeswijk support the Optimism Cycle hypothesis. Expected 
earnings growth rates follow a seasonal cycle and that these changes have an effect 
on stock performance. The third analysis uses initial IPO returns, which show 
a remarkable seasonality, as a proxy for investor confidence. Using this investor 
confidence proxy as an independent variable, the regression result for remaining 
excess return is not statistically significant, which supports the Optimism Cycle 
hypothesis. 

Along with the three supporting analyses, Doeswijk explains a qualitative argu- 
ment in favor of his Optimism Cycle hypothesis. He argues that, since this phe- 
nomenon is one based on an aspect of human psychology, it tricks investors into 
repeating the same biases every year. Importantly, this cycle of optimism and pes- 
simism is not generally accepted, which Doeswijk argues allows for investors who 
understand it to profit from it as a free lunch until it is more widely accepted and 
the arbitrage opportunity is absorbed into the market. 


1.4.1 Same month next year 


Heston and Sadka (2007) review predictability models for average stock returns 
based on past returns, such as short-term and long-term momentum and reversal 
effects. Several studies confirm various predictability patterns, but several potential 
explanations of these patterns exist, including data snooping, risk compensation, 
or behavioral theories. By expanding the analysis outside of the U.S., Heston et al. 
reduces the potential bias of data snooping and uncovers new information about the 
relative applicability of predictability models in different countries. The paper ana- 
lyzes twelve European countries plus Canada and Japan using a dataset of monthly 
returns from 1985 to 2006. 

Heston and Sadka analyze monthly stock returns using cross-sectional regres- 
sions. They begin their analysis by analyzing various lagging return variables 
to check for various existing temporal prediction patterns, such as short-term 
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momentum effects and medium-term reversal effects. They confirm that the 
momentum effect holds in their international sample, with the exception of medium- 
term momentum effects not being present in Japan. They also uncover a new pat- 
tern, a one-year lag with a reversal effect in the intervening months. Additionally, 
the study expands upon the positive return continuation of one-year lags by check- 
ing on lags of every 12 months for up to 120 months. They find that the positive 
returns are present in these longer term yearly lag periods, though some results 
in longer time frames were not statistically significant due to insufficiently large 
sample sizes in some cases. 

Heston and Sadka also analyze potential portfolio strategies to make use of 
this uncovered pattern. They use the decile spread methodology of Jegadeesh 
and Titman (1993) to investigate various time horizons and sorting methods and 
additionally analyze calendar effects, such as the January effect. They analyze if 
return patterns according to liquidity measures and by country and find that nei- 
ther explains the temporal pattern. They also consider if the temporal pattern 
is a result of common international risk factors, finding that different countries 
are correlated in the short term but over longer time horizons the correlation 
is, while still positive, notably weaker and sometimes statistically insignificant. 
They conclude that international diversification around this strategy is benefi- 
cial and reason that capital market segmentation may leave rewards for seasonal 
risks specific to different countries or that in different countries seasonal news 
may cause relatively predictable behavioral responses that are reflected in return 
movements. 


1.5 Holiday Effects 


There has been a very strong holiday effect in U.S. markets throughout the twentieth 
century. Ariel (1990), Zweig (1986), Lakonishok and Smidt (1988) have documented 
this. For example, for the ninety years from 1897 to 1986, Lakonishok and Smidt 
found that fully 51.5% of the non-dividend returns on the Dow Jones industrials 
were made on the approximately eight yearly preholidays. Ariel using data from 
1963-1982 found a very strong effect with the average preholiday having returns 
that were about 23 times an average day for large capitalized stocks measured by a 
value weighted index of all NYSE stocks. Small capitalized stocks (equally weighted 
NYSE stocks) had returns 14 times larger but since this period was one of extremely 
high small stock returns, the actual returns exceeded the large capitalized securities. 
Lakonishok and Smidt also found that preholidays were associated with higher mean 
returns on all days of the week compared to average returns those days. Investigation 
of the holiday effect in Japan by Ziemba (1991) yielded very similar results. Using 
daily data on the Nikkei stock average from May 1949 when the market opened 
up after World War II to 1988, he found that the typical preholiday had returns 
of about five times the average non preholiday trading day, namely 0.246% versus 
0.0489%. 
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Holiday effects in other countries are discussed by Ziemba (1994) and in the 
survey by Cervera and Keim (2000) and other papers in the Keim and Ziemba 
(2000) volume. 

Historically it was the preholiday that had the highest returns with the —3 day 
having the next highest returns, with the day after the holiday having negative 
returns. 

A regression to separate out the effects on trading days —3 to +2 around a 
holiday using 0,1 variables led to the daily return 


R = 0.0352 +0.0799Day_3 + 0.0222Day_2 +0.1894Day_1 — 0.0663Day +1 + 0.00114Day+2 
(3.745) (1.491) (0.424) (3.709) (—1.334) (0.023) 


Observe the high positive coefficient and t-value on the —1 day. The evidence we 
have to update this to 2010 is from the futures markets from 1993-2010 and it is that 
the effect seems to have moved to the —3 day before the holiday and is much weaker 
than in the past. The pre-holiday is marginally positive for both the S&P500 and 
Russell 2000. Labor day for the Russell 2000 is the most reliable with a mean gain of 
0.88% with a t = 4.81 and 15 of 16 positive. Presidents’ day was also reliable 82.4% 
of the time with a t = 2.11. Since none of the holidays were highly significantly 
positive for the S&P500 and, except for these two, the Russell 2000 results were 
marginal, we conclude that the holiday effect exists to some extent on the —3 day 
but has diminished greatly in the 1990s and 2000s. The mean gain for the S&P500 
was 0.19% (t = 1.74) and the Russell 2000 was 0.26% (t = 2.14). 

Table 1.6 documents the overall results for these two indices on days —3 to +2 
plus other days and all the days. 

Figure 1.14 shows the holiday average returns per day for the S&P500 on the 
days —3 to +2 from 1993-2010. This shows the strong —3 day. Figure 1.15 has this 
by holiday. Figures 1.16 and 1.17 have the results for the Russell 2000. 


Table 1.6: Futures Holiday Average Returns by Day, 1993-2011.* 


Pre H Pre H Pre H After H After H 

S&P500 All Days —3 —2 -1 1 2 Others 
Count 4784 151 151 151 151 151 3910 
St Dev 0.0126 0.0122 0.0099 0.0107 0.0128 0.0143 0.0127 
Average 0.02% 0.19% —0.03% 0.03% 0.06% 0.09% 0.01% 
z 1.1697 1.9549 —0.3156 0.3991 0.5748 0.7871 0.5517 
Positive 53.1% 57.0% 53.0% 51.0% 53.6% 57.6% 52.8% 
Russell 2000 

Count 4791 152 152 152 152 152 3911 
St Dev 0.0151 0.0153 0.0121 0.0133 0.0150 0.0186 0.0151 
Average 0.02% 0.31% 0.00% 0.06% 0.09% 0.09% 0.00% 
z 1.0982 2.4735 —0.0096 0.5802 0.7114 0.6176 0.0536 
Positive 53.2% 63.8% 53.3% 53.9% 50.0% 57.9% 52.7% 


*For some years Xmas 2 AH= New Year —3PH 
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Figure 1.14: S&P500 Futures Average Returns by Day (—3, —2, —1, 1,2) by Day 1993-2011. 


10% lente lee aecccnnesescccceeeeceecs qati- pr============ === === n_e 
0.8% E Os Una an ate pte eo force Panag a emmy ag megs | 
0.6% }---------------------- j 

OS 

0.2% 
0.0% 


0.2% E: 


-0.4% a a ee oe 
President's Day ' ! 
-0.6% | BG009 Friday | [ee ete EEE AP E PEE E A A ESEESE EE 


Memorial Day 
W independence Day 


-0.8% 
ElThanksgiving Day 
O Christmas Day ' ! ' ' 
NOS PEN SNIENE A na ha TN a a Oe od 


Figure 1.15: S&P500 Futures Holiday Average Returns by Holiday, 1993-2011. 


1.5.1 The sell on Rosh Hashanah and buy on Yom Kippur anomaly 


This Rosh Hashanah anomaly relates to the Jewish new year. In 2009, that was 
close of September 18 to open on September 28 for the trading days. The origin of 
this practice seems to be the belief of Jewish investors that they should liquidate 
their portfolios during the holiday so that their attentions could be fully focused on 
their worship; or more likely in today’s world, not trade. Could this really be true 
in 2009, 2010 and 2011? 
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Figure 1.16: Russell 2000 Futures Average Returns by Day (—3, —2, —1, 1, 2) by Day, 1993-2011. 
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Figure 1.17: Russell 2000 Futures Holiday Average Returns by Holiday, 1993-2011. 


Going back to 1915, the performance of the DJIA is —0.62% from the last close 
before Rosh Hashanah until eight days later with the last close before Yom Kippur. 
From then to December 31 averaged a respectable 1.99%, see TheStreet.com and 


the Kirk 


Report. 


What happened in 2009? The close on September 18 was 1068.30 and the close 
on September 25 was 1044.38, so —2.24%. So the anomaly worked once again. 
What happened in 2010? 
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Rosh Hashanah for Jewish Year 5771 occurred on sunset Wednesday, 
September 8, 2010—nightfall September 10, 2010 and Yom Kippur began on 
Friday, September 17, 2010. The close on September 8 was 1098.87 and the close 
on September 16 was 1124.66, for a gain of 2.35%. So the anomaly did not work in 
2010. The close on December 31, 2010 was 1257.64 or a gain of 11.82% from the 
close on September 16, 2010. 

Rosh Hashanah 2011 began on September 28 and the S&P500 cash close then 
was 1151.06. Yom Kippur 2011 began on October 7 with the S&P500 close at 
1164.97. The change was +1.2%. So the anomaly failed again. 

Returning to September—October. Figures 1.3ab and 1.4ab show the monthly 
effect in the S&P500 large cap and the Russell 2000 small cap future indices, 
respectively, from 1993-2011 and 2004-2011. Observe that over the longer hori- 
zon October is actually positive and September is slightly negative and there is no 
reliable monthly effect. As the historically strong months of January and February 
are negative, December is reliably positive though. 

More recently, from 2004-2011, October is massively negative except it was up 
10.5% in October 2011 and November, which historically has the strongest TOM is 
also negative. The year by year September—October returns from 1993 to 2010 are 
in Table 1.7. The pattern is clear: these months are like other months except that 
they frequently have big declines —8.38% (2001), 11.31% (2002) and —9.79% (2008) 
for September and the —20.11% (2008) for October for the S&P500. October had 
other great falls in 1929, 1987 and other years. However, over the years 1998-2010 
both September and October were positive on average. The Russell 2000 is similar. 
Additional data is in the Appendix. 


1.5.2 Ramadan 


Bialkowski, Etebari, and Wisniewski (2009) study stock returns during the Muslim 
holy period of Ramadan in 14 predominantly Muslim country during the period 
from 1989-2007. They find that stock returns during Ramadan are almost nine 
times higher (38.1% versus 4.3%) than during the rest of the year and that this 
conclusion persists after controlling for other known calendar anomalies like the 


Table 1.7: S&P500 Futures Average Returns, September and October, 1993-2010. 


1993 1994 1995 1996 1997 1998 1999 2000 2001 
Sep —0.87%  —2.17% 4.48% 6.09% 5.51% 6.86% 1.79% 4.58% 8.38 
Oct 1.87% 2.59% —0.77% 2.59% —3.78% 7.38% 5.73% —1.15% 1.46% 


2002 2003 2004 2005 2006 2007 2008 2009 2010 
Sep  -11.31% —1.43% 0.94% 1.03% 3.02% 4.07% —9.79% 3.16% 8.31% 
Oct 7.96% 5.51% 1.32% —2.08% 2.79% 1.00% —20.11% —2.07% 3.74% 


Average StDev t 
Sep 0.17% 0.058 0.128 
Oct 0.78% 0.061 0.536 
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January effect, Halloween effect, and Turn-of-the-Week effect. Their explanations for 
this phenomenon rely on recent behavioral theories that connect investor emotions 
with their decisions. Specifically, they suggest that these excess returns are a result 
of increased investor optimism experienced during Ramadan as a time of relative 
happiness, solidarity, and social identity for Muslims; they go as far as to suggest 
that Ramadan may cause mild states of euphoria, as suggested by Knerr and Pearl 
(2008). This upbeat or positive sentiment then causes relative overconfidence and 
an increased willingness to take risk, such that investors perceive investments as 
of relatively higher value. Bialkowski et al. also consider that during this relatively 
healthy period there may be a higher demand for equities, as documented by Rosen 
and Wu (2004). They, however, do not find any evidence of a higher trading volume 
during Ramadan. 

In their study, Bialkowski et al. discuss Ramadan to shed light on its potential 
emotional effects, review the clinical effects of fasting, and review empirical evidence 
on the effects of Ramadan on equity prices in 14 Islamic countries. Their empirical 
results compare the average returns during the holy month and the rest of the 
year and find that 11 of 14 countries studied have higher returns during Ramadan; 
the countries that did not exhibit this anomaly are Bahrain, Saudi Arabia, and 
Indonesia. They mention the effects of relatively few observations in the case of 
Bahrain and Saudi Arabia and note the outlier of the Asian Crisis in Indonesia 
effecting its equity prices during Ramadan. 

They further test their results by two event studies, benchmarking returns 
against a constant-mean-return model as well as a predictive market model using 
a proxy of 23 industrialized countries that do not have Muslim majorities. In these 
tests, they calculated cumulative abnormal returns (CAR) as returns in excess of 
what an investor should expect in the absence of Ramadan, finding this CAR to fall 
between 2.5%-3.1% depending on the event study approach. Among other robust- 
ness tests, Bialkowski et al. analyze if this CAR may be compensation for increased 
risk during Ramadan by examining return volatility, but do not find any supporting 
evidence and in fact find that, except Turkey, all countries studied actually showed 
decreased volatility during Ramadan. 

Bialkowski et al. also test their results for illiquidity effects, exchange rate 
considerations, and other accepted calendar effects, and find that the Ramadan 
Effect remains anomalous and can most likely explained by temporal changes in 
investor psychology. They conclude that investors can profit in Muslim stock mar- 
kets by buying shares at the beginning of Ramadan and selling them at its end, 
though they do not explicitly model this strategy nor do they estimate transaction 
costs. 


1.6 Day of the Week Effects 


Historically in U.S. markets, there have been differences in the daily mean returns 
across the days of the week. A common finding is high Friday returns and low 
Monday returns. Early papers showing this are Cross (1973), French (1980), 
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Gibbons and Hess (1981), Gultekin and Gultekin (1983), Lakonishok and Levi 
(1982), and Rogalski (1984). Lakonishok and Smidt (1988) relate the weekend effect 
to the TOM effect, see that section of this chapter. Harris (1986) investigated time 
of day effects. Stocks advanced near the close in all days including on the negative 
Mondays and positive Fridays. The Monday negative returns accrued over the week- 
end for large cap stocks but during the Monday trading session for small cap stocks. 
Stocks tend to advance in the first 45 minutes of trading on all days except Monday 
where they fall. Wang, Li and Erickson (1997) using 1962-1993 data show that 
the Monday declines are from last two Mondays of the month and that there is no 
monthly effect in the first three weeks. Kamara (1997) showed that the weekly effects 
declined in 1962-1993 because of increased institutional trading in large cap stocks. 
But the small cap effect remains. Futures minus spot S&P returns are reversed as 
traders anticipate the effect. Chen and Sinal (2003) argue that short sellers close 
positions on Friday which increases prices and the new Monday shorts lead to Mon- 
day losses. Chan, Leung and Wang (2004) argue that the Monday decline effect is 
largely due to individual not institutional investors. The Monday decline is strongest 
in stocks with low institutional holdings. Moreover, the mean return on Monday is 
the same as on the other four days of the week for stocks with high institutional 
holdings. 

Table 1.8 from French (1980) gives the mean daily returns and other character- 
istics for the US S&P500 index for 1953-1977 and five year sub-periods. One sees 
in this 25 year period negative Mondays and positive Fridays on average. 

International evidence was provided by Dubois and Louvet (1966), Jaffe and 
Westerfield (1985ab), Jaffe, Westerfield and Ma (1989). Steeley (2001) argued that 
the weekend effect in the U.K. disappeared in the 1990s. Moreover, the day-of-the- 
week effects are explained by news arrivals. 

Kato (1990), Kato, Schwartz and Ziemba (1989), Ziemba (1993) also study 
Japanese returns. Kato (1990) found negative Tuesdays not Mondays and posi- 
tive Wednesday and Saturday returns. Ziemba (1993) investigated weeks ending 
on Friday with a full trading session in two parts with a break or on Saturday 
with only the first session. He found that in the weeks with Friday endings, 
Fridays were positive and Mondays negative. But in the weeks with Saturday 
trading, Saturdays were higher positive and the negative day was the Tuesday. 
Somehow the two days were needed for the fall. Choudhry (2000), using January 
2000 to June 2005 data from Indonesia, Malaysia, the Philippines, South Korea, 
Taiwan and Thailand showed the presence of the day of the week effects in 
both stock returns and volatility and a possible spillover from the Japanese stock 
market. 

Table 1.9 from Chukwuogor-Ndu (2006) shows the mean return from Euro- 
pean stock markets from January 1997 to December 2004. Friday is positive in all 
15 countries studied and Mondays are mixed, some positive, some negative. 
Tuesdays and Thursdays are also positive in most countries. Standard deviations 
are high so statistical significance is weak. 
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Table 1.8: Means, Standard Deviations, and T-Statistics of the Percent Return From the Close 
of the Previous Trading Day to the Close of the Day Indicated. 


Monday Tuesday Wednesday Thursday Friday 


1953-1977 Mean —0 1681 00157 0 0967 0 0448 0 0873 


Standard deviation 0 8427 0 7267 0 7483 0 6857 0 6600 
t-statistic —6 823° 0 746 4 534° 2 283> 4 599° 
observations 1170 1 193 1 231 1 221 1 209 
1953-1957 Mean —0 2256 —0 0096 0 1592 0 0553 01413 
Standard deviation 0 8998 0 7498 0 7141 0 6751 06222 
t-statistic —3 851° —0 197 3 497° 1 287 3 533° 
observations 236 238 246 247 242 
1958-1962 Mean —0 1691 0 0537 0 0777 0 0652 0 1131 
Standard deviation 0 8512 0 7223 0 6503 0 6347 0 6097 
t-statistic —3 045° 1 149 1 885» 1 624 2 892° 
observations 235 239 249 250 243 
1963-1967 Mean —0 1389 0 0385 0 1008 0 0517 0 1015 
Standard deviation 0 5820 0 4991 0 5515 0 4933 0 4386 
t-statistic —3 650° 1 193 2 884° 1 660° 3 600° 
observations 234 238 249 251 242 
1968-1972 Mean —0 1673 —0 0058 0 1465 0 0003 0 1034 
Standard deviation 0 7769 0 6233 0 7425 0 6516 0 5898 
t-statistic —3 266° —0 144 3 005° 0 007» 2 705° 
observations 230 239 232 225 238 
1968-1972 Mean —0 1393 —0 0016 0 0057 0 0470 —0 0219 
Standard deviation 1 0379 0 9609 0 9968 0 9102 0 9304 
t-statistic —2 058» 0 026 0 091 0 813 —3 368 
observations 235 239 255 248 244 


aReturns for periods including a holiday are omitted. These returns are defined as Ry = 
ln(P;/P;—1)100. 

b5% significance level 

c0 5% significance level 

Source: French (1980) 


French (1980) using data from 1953 to 1977 on the S&P composite found that 
Friday returns had a positive mean as did Tuesday to Thursday but Monday had 
negative returns. This refuted two hypothesis: 


1. namely that returns occur continuously over time so Monday should have three 
times the average day mean time; and 

2. that all days have the same mean return since returns are generated during 
trading time which are the same for all days. 


Keim and Stambaugh (1984) extend the research back to 1928 that Mondays have 
negative mean returns for exchange traded stocks of all firm sizes and for OTC 
stocks. 

They also found that Friday to Monday correlations were higher than the other 
days. Abraham and Ikenberry (1994) find the same effect. When Friday is negative, 
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Table 1.9: Average Daily Returns of the EFM for the Period January 2, 
1997 — December 31, 2004. 


Country Mon Tue Wed Thur Fri 


Austria 0.0299 0.0233 0.0118  —0.0157 0.0037 
Belgium 0.0036 —0.3000 0.0011 0.3270 0.0134 
Czech Republic 0.0059 0.0154 0.0229 0.0242 0.0152 
Denmark 0.0057 0.0185 0.0213 0.0153 0.0240 
France —0.0120 0.0178 —0.0094 0.0278 0.0139 
Germany 0.0395 0.0253 —0.0196 0.0104 0.0125 
Italy —0.0137 0.0360 —0.0286 0.0280 0.0384 
Netherlands 0.0540 0.0179 -—0.0194 —0.0010 0.0252 
Russia 0.0428 0.1280 —0.0903 0.0721 0.1125 
Slovakia —0.1101 0.0371 0.0480 0.0313 0.0035 
Spain —0.0303 —0.7170 —0.0532 0.7840 0.0220 
Sweden 0.0551 0.0138 —0.0564 0.0177 0.0407 
Turkey —0.1730  —0.0329 0.0122 0.2069 0.2396 
Switzerland 0.0070 0.0132 0.0121 0.0245 0.0357 


United Kingdom —0.0181 0.0200 0.0095 0.0141 0.0313 
Source: Chukwuogor-Ndu (2006) 


Monday is negative 80% of the time with a —0.61% mean return. But when Friday 
was positive Monday is positive on average returning +0.11%. The effect like most 
anomalies is strongest in small and mid-cap companies. 

Lakonishok and Maberly (1990) show that there was an increase in individual 
as opposed to institutional trading who sell more than buy on Mondays. Connolly 
(1989) showed that the strength of the day of the week finding depend crucially on 
the estimation and testing methods used. Sullivan, Timmermann and White (2001) 
provide a strong econometric argument regarding the care needed to get proper 
statistical conclusions. Using 100 years of daily data, they find anomalies like the 
day-of-the-week and weekend effects are significant but if you consider data mining, 
the anomalies are not statistically significant over the whole universe of data. 

Summary: the cash evidence is strong that there was a weekend effect with 
positive Fridays and negative Mondays. But the strength of the effect has diminished 
and it is hard to prove that it is not data snooping and out of sample. Moreover, the 
effect seems to reverse in the futures markets. The reasons for the effect are many 
and varied. The question of can you implement strategies based on these results 
remains. Arsad and Coutts (1997) argue that liquidity and other transactions costs 
might limit this possibility. We now look at the recent data in the futures markets. 

To update, we compute the daily returns in the S&P500 and Russell 2000 futures 
from 1993-2011. Recall that with anticipation the futures might reverse the cash 
effect. Figure 1.18ab and Tables 1.10 give the results. In the futures markets for the 
S&P500, all days was positive with Thursday essentially zero Monday has higher 
average returns than all the days. All the days have gains slightly more than 50% of 
the time. The Russell 2000 is very different with higher returns on all the days except 
for Mondays which were negative. Still all days were positive slightly more than 50% 
of the time. Finally, Table 1.11 and Figures 1.19ab show the results by month. 
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Figure 1.18: Futures Daily Returns as Function of Day of the Week, 1993-2011. 


1.7 Option Expiry Effects in the Russell 2000 and S&P500 
Futures Markets 


Options expiry tends to be a positive period in the third week of the month; see 
Figure 1.20 and Table 1.12 for specific results. As usual, the fluctuations are stronger 
for the small cap Russell 2000 with a quite strong —3 day with a mean return of 
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Table 1.10: Futures Daily Returns as Function of Day of the Week, 


1993-2011. 

Mon Tue Wed Thu Fri All 
S&P500 
Count 917 979 978 962 956 4792 
Mean 0.0139 0.0130 0.0120 0.0124 0.0114 0.0126 
StDev 0.03% 0.06% 0.00% 0.00% 0.01% 0.02% 
t-stat 0.5823 1.4434 0.0776 0.1000 0.3509 1.1771 
Positive 54.7% 51.1% 54.1% 53.5% 52.0% 53.1% 
Russell 2000 
Count 923 978 978 967 957 4803 
Mean 0.0163 0.0153 0.0149 0.0151 0.0135 0.0150 
StDev —0.05% 0.07% 0.04% 0.02% 0.04% 0.02% 
t-stat —0.8561 1.4637 0.7715 0.3519 0.8436 1.1043 


Positive 51.2% 52.4% 55.2% 52.0% 54.6% 53.1% 


Table 1.11: Futures Daily Returns as Function of Day 
of the Week by Month, 1993-2011. 


Mon Tue Wed Thu Fri 


S&P500 

January 0.18 —0.12 0.13 —0.10 —0.14 
February 0.11 0.13 0.05 0.11 0.16 
March 0.01 0.10 0.03 0.17 0.08 
April —0.06 0.30 0.00 0.31 —0.15 
May 0.24 0.05 0.03 —0.14 —0.02 
June —0.09 —0.04 0.04 0.12 —0.04 
July 0.00 0.00 0.09 0.00 —0.07 
August —0.01 —0.14 0.20 —0.16 —0.06 
September —0.14 0.15 —0.11 —0.14 0.27 
October 0.14 0.17 —0.31 0.08 0.14 
November 0.08 0.03 —0.01 0.06 0.12 
December 0.05 0.18 0.13 —0.13 0.18 
Russell 2000 

January 0.08 —0.10 0.12 —0.06 —0.25 
February 0.10 —0.11 0.02 —0.10 —0.08 
March —0.11 0.10 0.07 0.27 —0.01 
April —0.34 0.32 0.13 0.42 —0.12 
May 0.26 0.15 —0.12 —0.07 0.05 
June —0.19 —0.06 0.15 0.16 0.10 
July —0.10 0.00 0.01 —0.15 —0.06 
August —0.08 —0.14 0.33 —0.15 0.01 
September —0.10 0.27 —0.10 —0.10 0.23 
October 0.09 —0.14 —0.30 0.04 0.15 
November 0.08 0.00 0.07 —0.03 0.08 


December —0.01 0.33 0.27 —0.08 0.38 


+0.61% on that day with 67.1% positive and a t-value of 3.62. The other days are 
basically noise. The S&P500 also has on average a gain of 0.40% on the —3 day and 
noise the other days around the options expiry. The ¢ for this day is 2.9 with 60.5% 
positive days. 
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Figure 1.19: Futures Daily Returns as Function of Day of the Week by Month, 1993-2011. 
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Figure 1.20: Futures Average Returns by Day Around Options Expirations, 1993-2011 (N = 1 
Options Expirations Day). 


1.8 Seasonality Calendars 


Ziemba and Schwartz (1991) made seasonality calendars for the first section of the 
Tokyo Stock Exchange (TSE) for the years 1988-1992. The days were ranked from 
—4, the worst, to +4, the best. A regression predicts the expected daily change of 
the index based on all the seasonal effects. So each trading day has both a ranking 
and an expected performance. Of course, this ranking and performance does not 
consider the current economic environment and recent news so it could be off. But 
it gives one an idea of what to expect and what to look for. 
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Table 1.12: Futures Average Returns by Day Around Options Expi- 
rations, 1993-2011. 


3 2 1 1 2 Others 
S&P500 
Count 76 76 76 76 76 4331 
St Dev 0.0121 0.0111 0.0112 0.0095 0.0127 0.0126 
Average 0.40% —0.03% 0.17% 0.01% —0.05% 0.01% 
Z 2.9003 —0.2086 1.3313 0.0551 —0.3184 0.7142 
Positive 60.5% 55.38% 63.2% 50.0% 46.1% 52.7% 
Russell 2000 
Count 76 76 76 76 76 4347 
St Dev 0.0148 0.0118 0.0126 0.0106 0.0166 0.0151 
Average 0.61% —0.05% 0.23% -—0.01% —0.15% 0.02% 
Z 3.6224 —0.3873 1.6119 —0.1234 —0.7912 0.7452 


Positive 67.1% 52.6% 56.6% 50.0% 39.5% 53.1% 


Canestrelli and Ziemba (2000) investigated seasonal anomalies in the Italian 
stock market from 1973-1993. The results show that the effects have been found in 
the U.S., Japan and other markets such as the weekend, turn-of-the-year, monthly, 
holiday and January barometer were present in Italy during that period. The data 
used were 7668 days from January 3, 1973 to December 31, 1993 of which 5238 
were trading days. The highest daily return was +8.03% and the lowest —10.02%. 
The return distribution was fatter in the tails than Gaussian normal and there was 
autocorrelation in these returns. They did a careful analysis of these anomalies and 
readers may refer to their paper in Keim and Ziemba (2000). They also ranked the 
days into seasonality calendars using the following regression: 


Rt= a1D Monday + a2 DTuesday + a3D Friday + a4DistDay T as D Easter 


+ as DChristmas = a7DistNov + agD Jan + ag D Day30 Te a10D pay31 + ut 


Monday Tuesday Friday 1st Day Easter Xmas Ist Nov Jan Day 30 Day 31 
Mean -0.2221 -0.2082 0.0705 1.2667 0.6832 0.3048 0.6672 0.1419 0.3150 0.2386 
(St Dev) (0.0442) (0.0431) (0.0439) (0.0809 (0.2945) (0.29801) (0.2820) (0.0653) (0.1003) (0.1355) 
t-stat -5.03** -4.83** 1.61 15.66** 2.32* 1.09 2.37* 2.17* 3.14** 1.76 


Cost = 0.00R2 = 0.0582 R2aaj = 0.0566D.W. = 1.58F = 32.44«*ReferringtoHo : a1 = a2 = a3 = =aio=0. 


* indicates 5% confidence level ** indicates 1% confidence level 


The results indicate that there was the following ranking of the anomalies by impor- 
tance of their effect on daily stock returns: 


First day of the monthly account 
Monday 

Tuesday 

Day 30 (calendar, not trading) 
15t November 

Easter 

January 


SVs oh te he 
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January 2010 Calendar and Returns 


Sunday Monday Tuesday Wednesday Thursday Friday Saturday 
Legend Code 1 New Year's Day 2 
RL code RL day score|TOM = —Turn-of-the-month 
RL realr RL expected r/RL = Russell 2000 future 
SP code SP day score|SP = S&P500 future 
SP realr SP expected r| 
3 4 5 6 th 8 9) 
Tom 0 0 0 0 o| 
2.15% 0.15%| -0.20% 0.15%| -0.02% -0.12%| 0.50% -0.12%| 0.64% -0.12% 
irom 1 1 o o 0 
1.62% 0.12%] 0.31% 0.12%) 0.07% -0.10%| 0.40% -0.10%| 0.35% -0.10% 
10 11 12 13 14 15 16 
o 0 o o o 
-0.50% -0.12%| -0.67% -0.12%| 0.82% -0.12%| 0.76% -0.12%| -1.42% -0.12% 
o 0 o o 0 
0.09% -0.10%] -0.74% -0,10%| 0.66% -0.10%] 0.33% -0.10%] -1.14% -0.10% 
17 18 ML King Day 19 20 21 22 23 
0 o o 0 
1.54% -0.12%| -1.30% -0.12%] -1.96% -0.12%| -1.47% -0.12% 
0 o o o 
1.19% -0.10%| -1.03% -0.10%] -2.03% -0.10%| -1.80% -0.10% 
24 25 26 27 28 29 30 
irom ifrom ifrom 1from ifrom 1 
-0.08% 0.17%] -0.81% 0.17%] 1.00% 0.17%] -1.85% 0.17%] -0.74% 0.17% 
[TOM 1}Tom 1|Tom 1}Tom 1jtom 1 
0.14% 0.12%] -0.48% 0.12%] 0.67% 0.12%| -1.39% 0.12%] -0.81% 0.12% 
31 
Jan 2010 RL return -3.79% 
Jan 2010 SP return -3.72% 


Figure 1.21: January 2010 Calendar and Results. 


8. Day 31 (calendar, not trading) 
9. Friday 
10. Christmas 


Dzhabarov and Ziemba made similar calendars for the S&P500 and Russell 2000 for 
2010. Figure 1.21 shows the seasonality calendar and results for the S&P500 and 
Russell 2000 futures for January 2010. The results of the exercise with January’s 
S&P500 = —3.72% and Russell 2000 = —3.79% are in Figure 1.21. When the authors 
created the calendar in 2009 they chose to use the coefficients from 2004—2009 rather 
than from 1993-2009 to employ the most current results. This way also offered better 
goodness of fit (higher R-squared). 

Each box for a given trading day shows the model expected returns on the 
right and real returns added after January 2010 on the left for each of two index 
futures plus a ranking from —4 the worst to —3, —2, —1,0, +1, +2, +3 and +4 the 
best (Russell 2000 first, S&P500 second). Obviously, economic news may dominate 
a particular day, but the calendars add value. To compute the calendars, regres- 
sions were run on all the days around an anomaly concept and then a streamlined 
equation with only the best predictors was used to construct the expected values 
and calendar weightings. The authors also used turn-of-the-year variable which con- 
sistently generated positive returns though not being statistically significant. The 
variables that are working: for Russell 2000 3rd day before holiday (positive), TOM 
good days (positive), turn-of-the-year good days (positive), 3° day before options 
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Table 1.13: Number of Days Ranked in Each Category for Each 
Month for the S&P500 and Russell 2000 in 2010. 


Russell 2000 S&P500 
Day Scores 0 1 2 3 4 0 1 2 3 4 
January 14 5 12 7 
February 11 7 1 11 7 1 
March 15 6 1 1 15 6 2 
April 15 6 15 6 
May 14 5 1 14 5 1 
June 14 6 1 1 14 6 2 
July 14 7 14 7 
August 15 7 15 7 
September 13 6 1 1 13 6 2 
October 14 7 14 7 
November 14 6 1 14 6 1 
December 17 3 1 1 12 7 1 2 
Total: 170 71 2 5 4 163 77 3 0 9 


expiry (positive) and for S&P500 3°¢ day before holiday (positive), turn-of-the- 
month good days (positive), turn-of-the-year good days (positive), 3°¢ day before 
options expiry (positive). 

How did the calendars do in 2010? We ran a test starting with initial wealth of 
$100 at the beginning of January. There was a $4 bet if the day’s rating was +4, 
$3 for +3, etc down to —$3 for —3 and —$4 for —4. Table 1.13 shows the number 
of trading days by rank for each month for the S&P500 and Russell 2000. Most of 
the days are ranked 0 or +1 with a few +2, 3, or 4. The lower two panels of this 
table are the number of days for each index that were actually positive and the 
percent positive. Figure 1.22 shows the graphs of the trading results and Table 1.14 
the monthly actual returns. Basically the calendars seem to beat the index when 
the index has low returns but underperform in strong market periods. 


1.9 Political Effects? 


1.9.1 When Congress is in session 


Ferguson and Witte (2006) find a strong correlation between Congressional activity 
and stock market returns such that returns are higher and volatility lower when 
Congress is in session. They use four data sets, including the Dow Jones Industrial 
Average since 1897, the S&P500 index since 1957, and the CRSP value-weighted 
index and CRSP equal-weighted index since 1962. They compare mean daily stock 
returns and annualized returns when the U.S. Congress is in and out of session. 
Depending on the index tested, statistically significant differences in average daily 
returns range from 4—11 basis points per day. Annualized stock returns are 3.3-6.5% 


3This is an updated and modified version of discussion in Ziemba (2012) 
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Figure 1.22: 2010 Calendar and Returns Trading. 


Table 1.14: 2010 Returns From Russell 2000 and S&P500 Buy 
and Hold Versus Calendar Returns. 


Buy & Hold Calendar 

Russell 2000 S&P500 Russell 2000 S&P500 
January —3.79% —3.72% —2.43% —1.68% 
February 4.35% 2.98% 5.03% 4.56% 
March 7.74% 5.57% 4.45% 2.68% 
April 5.50% 1.47% 0.20% 0.11% 
May —8.25% —8.41% 5.56% 3.21% 
June —8.54% —5.94% —2.84% —2.81% 
July 6.48% 6.80% —5.92% —1.72% 
August —7.67% —4.69% 1.45% 1.72% 
September 11.85% 8.31% 10.39% 6.28% 
October 4.00% 3.74% 1.02% 1.30% 
November 3.31% —0.11% —1.42% —3.26% 
December 7.62% 6.18% 2.75% 2.55% 


higher when Congress is out of session, and between 65-90% of capital gains have 
occurred when Congress is not in session (which is notably greater than the pro- 
portionate number of days Congress is not in session). 

Ferguson and Witte also test these results in several ways. First, they analyze 
if the Congressional effect is just a proxy for other known calendar effects, such as 
the Day-of-the-Week effect, January effect, and Pre-Holiday effect. They conclude 
that, after controlling for these anomalies, there is still a Congressional effect of 
3-6 basis points per day, which means that no more than half of the Congressional 
effect is captured by controlling for other known anomalies. The study also tests for 
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robustness and finds there is a low probability that these results are the effect of a 
spurious statistical relationship. 

Next they test if public opinion toward Congress accounts for the Congressional 
effect by using public polling data as a proxy for general investors attitudes toward 
Congress. They use 162 polls from 1939 to 2004, though 112 of these were conducted 
after 1989. They find that an active Congress does not itself lead to poor stock 
returns but rather that the publics opinion of that active Congress accounts for 
the depressed returns. They also find that each index exhibits volatility that is 
significantly lower when Congress is not in session and that this is also driven by 
public opinion. 

Then Ferguson and Witte test the implications of this predictive capability on 
optimal investor asset allocation using the models of Kandel and Stambaugh (1996) 
and Britten-Jones (1999); they find that trading on the Congressional effect would 
allow investors to better allocate between equities and cash and to achieve a higher 
Sharpe ratio. 

Ferguson et al. consider three alternatives as possible explanations of the Con- 
gressional Effect, concluding that their findings may be explained by viewing public 
opinion of Congress as a proxy for investors moods, regulatory uncertainty, or rent- 
seeking. The mood-based hypothesis follows other studies in behavioral finance that 
suggest depressed investors are relatively risk averse, which in this case would imply 
that negative public opinion of Congress was depressing investors and dampening 
returns. The regulatory uncertainty hypothesis follows from the implication that 
there is more uncertainty in the market when Congress is in session, such that risk 
and therefore returns are higher. The rent-seeking hypothesis is based on Rajan 
and Zingales (2003) and suggests that concentrated economic interests limit the 
efficiency of markets such that they are less efficient and biased toward powerful 
financial players when Congress is in session. 


1.9.2 Election cycles 


Herbst and Slinkman (1984), using data from 1926-1977, found a 48-month 
political/economic cycle during which returns were higher than average; this cycle 
peaked in November of presidential election years. Riley and Luksetich (1980) and 
Hobbs and Riley (1984) showed that, from 1900-1980, positive short-term effects 
followed Republican victories and negative returns followed Democratic wins. Huang 
(1985), using data from 1832-1979 and for various subperiods, found higher stock 
returns in the last two years of political terms than in the first two. This finding 
is consistent with the hypothesis that political reelection campaigns create poli- 
cies that stimulate the economy and are positive for stock returns. These studies 
concerned large cap stocks. 

Hensel and Ziemba (1995c, 2000) investigated several questions concerning US 
stock, bond and cash returns from 1928 to 1997. They asked: Do small and large 
capitalization stock returns differ between Democratic and Republican adminis- 
trations? Do corporate bond, intermediate and long-term government bonds and 
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Treasury bill returns differ between the two administrations? Do the returns of var- 
ious assets in the second half of each four-year administration differ from those in 
the first half? Were Clinton’s administrations analogous to past Democratic admin- 
istrations? We also discuss here the terms of George W. Bush and Barack Obama 
to update to the end of 2011. 

Their results indicate a significant small cap effect during Democratic presiden- 
cies. Small cap stocks (the bottom 20% by capitalization) had higher returns dur- 
ing Democratic than Republican administrations. There has also been a small cap 
minus large cap S&P advantage outside the month of January for the Democrats. 
The higher returns with Democrats for small cap stocks are the result of gains 
rather than losses in the April-December period. The TOY small firm effect, in 
which small cap stock returns significantly exceed those for large cap stocks in 
January, under both Republican and Democratic administrations, occurred during 
these 70 years. This advantage was slightly higher for Democrats, but the difference 
is not significant. Large cap stocks had statistically identical returns under both 
administrations. For both Democratic and Republican administrations, small and 
large cap stock returns were significantly higher during the last two years of the 
presidential term than during the first two years. Moreover, bond and cash returns 
were significantly higher during Republican compared with Democratic administra- 
tions. The results also confirm and extend previous findings that equity returns have 
been higher in the second half compared with the first half of presidential terms. 
This finding is documented for small and large cap stocks during both Democratic 
and Republican administrations. Finally, two simple investment strategies based 
on these findings yielded superior portfolio performance compared with common 
alternatives during the sample period. The results cast doubt on the long run wis- 
dom of the common 60/40 stock-bond strategy since all 100% equity strategies 
investigated had much higher wealth at the end of the sample period. Indeed the 
1942-1997 returns were twenty-four times higher with the strategy small caps with 
Democrats and large caps with Republicans than the 60/40 mix and the updated 
1998-2010 returns shown in Table 1.20 show similar outperformance. 

Table 1.15 shows that both small and large cap stocks had lower mean returns in 
the 13 months following an election. Figure 1.24 shows the specific months following 
the election for large (S&P500) and small cap (bottom 20%) stocks. 


Table 1.15: Annual Average Equity Returns for Presidential Election Months 
and the Subsequent 13 Months, 1928-1997, Minus Annualized Monthly 
Averages.* 


1928-1997 1998-2010 1928-2010 
Return Period Large Small Large Small Large Small 
Election + Next 13 Months 8.12 6.51 4.08 12.20 7.54 7.33 
Annual Average 10.12 12.02 5.19 8.22 9.34 11.42 


Annual Difference —2.00 —5.51 —1.11 4.00 —1.79 —4.09 


*Monthly means were annualized by multiplying by 12 
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The 1928-1997 period encompassed 18 presidential elections with an update to 
2010 and three more elections. The end of 1997 included the first year of Clinton’s 
second term. There were 33 years of Republican and 37 years of Democratic admin- 
istrations during this period. The update to the end of 2010 covers the last three 
years of Clinton’s second term plus two George W. Bush terms plus the first two 
years of Barack Obama’s administration, namely 1998 to 2010, a period where small 
cap stocks outperformed large cap stocks. Tables 1.16 and 1.17 list and compare 
the first year, first two year, last two year and whole term mean returns under 
Democratic and Republican administrations from January 1929 to December 1997 
and for January 1937 to December 1997, a period that excludes one term for each 
party during the 1929 crash, subsequent depression and recovery period plus the 
update to 2010. Each term is considered separately, so two-term presidents have 
double entries. The t-values shown in Table 1.16 test the hypothesis that, during 
the 1928-1997 period, returns did not differ between Democratic and Republican 
administrations. 

From 1929 to 1997, the mean returns for small stocks were statistically higher 
during the Democratic presidential terms than during the Republican terms. The 
data confirm the advantage of small cap over large cap stocks under Demo- 
cratic administrations. Small cap stocks returned, on average, 20.15% a year under 
Democrats compared with 1.94% under Republicans for the 1929-1997 period. This 
difference, 18.21%, was highly significant. The first year return differences for this 
period were even higher, averaging 33.51%. 

The right hand panel of Table 1.16 presents the return results after eliminating 
the 1929 crash, the Depression and the subsequent period of stock price volatil- 
ity. Removing these eight years (1929-1936) from the study eliminates one Demo- 
cratic and one Republican administration from the data. The small stock advantage 
under Democrats was still large (an average of 7.55% per four-year-term) but was 
no longer statistically significant. The large cap (S&P500) returns during Demo- 
cratic rule were statistically indistinguishable from the returns under Republican 
administrations. Table 1.17 has the update to 2010. 

For Democratic and Republican administrations, the mean small and large cap 
stock returns were much higher in the last two years compared with the first two 
years of presidential terms for both of the time periods presented in Table 1.16. For 
example, small cap stocks returned 24.65% during the last two years compared with 
15.90% during the first two years for Democrats and 10.18% compared with —6.29% 
for Republicans from 1929 to 1992. Returns on large cap stocks increased to 17.40 
from 8.09% for Democrats and to 9.06% from 3.77% for Republicans for the same 
period. This result is consistent with the hypothesis that incumbents embark on 
favorable economic policies in the last two years of their administrations to increase 
their reelection chances and that the financial markets view these policies favorably. 

The advantage of small stocks over large stocks under Democratic administra- 
tions was not a manifestation of the January small stock effect. Instead, Tables 1.23 
and 1.18 and Figure 1.23 show the relative advantage of small over large cap stocks 
under Democrats compared with that under Republicans was attributable to having 
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Table 1.16: Average Annual Returns for the First and Second Years and Four Years of Demo- 
cratic and Republican Presidencies* to 1997. 


January 1937 to December 1997 January 1929 to December 1997 
S&P500 TR US Small Stk TR S&P500 TR US Small Stk TR 


Democrat 
Avg 1st Yr 6.58 11.32 10.24 19.06 
Avg 1st 2Yrs 6.14 11.85 8.09 15.90 
Avg Last 2Yrs 16.13 24.11 17.40 24.65 
Avg. Term 10.81 16.71 12.62 20.15 
StdDev Term 16.35 27.76 18.26 30.69 
# Years 36.00 36.00 37.00 37.00 
Republican 
Avg 1st Yr 1.87 —6.22 0.54 —14.45 
Avg 1st 2Yrs 6.98 1.39 3.77 —6.29 
Avg Last 2Yrs 15.03 16.95 9.06 10.18 
Avg. Term 11.00 9.17 6.42 1.94 
StdDev Term 15.12 19.89 21.17 27.81 
# Years 28.0 28.0 32.0 32.0 
Diff Ist Yr 4.72 17.54 9.71 33.51 
Diff 1st 2Yrs —0.84 10.46 4.32 22.19 
Diff Last 2Yrs 1.10 7.16 8.33 14.47 
Diff Term —0.19 7.55 6.20 18.21 
lst year t-values 0.67 1.39 1.15 2.58 
(Ho:Diff=0) 
First 2-years t-values —0.14 1.13 0.69 2.39 
(Ho:Diff=0) 
Last 2-years t-values 0.20 0.69 1.20 1.41 
(Ho:Diff=0) 
Term t-values —0.05 1.04 1.29 2.57 
(Ho:Diff=0) 


*In this and subsequent tables, statistically significant differences at the 5% level (2-tail) are 
shown in bold. 
Source: Hensel and Ziemba (2000) 


fewer small stock losses, as well as higher mean small stock returns, in the April- 
December period. Under Democrats, the mean returns were positive in each of these 
months, except October, and the small minus large differential was positive during 
10 of the 12 months; under Republicans, the small minus large differential was 
negative during 9 of the 12 months. 

The small cap advantage also occurred in the months following Democrat 
Clinton’s first election. From November 1992 to December 1993 the small cap index 
rose 36.9% versus 14.9% for the S&P500. This domination continued until the sec- 
ond election. Small caps returned 1.58% per month versus 1.31% per month for the 
S&P500 from November 1992 to October 1996. However, large cap S&P500 returns 
began exceeding small cap returns in 1994 and this continued through 1997. The 
January 1994 to December 1996 returns were small cap 1.36% per month versus 
1.50% per month for the S&P500. From November 1996 to December 1997 small 
caps returned 1.81% per month and the S&P500 2.44% per month. There was a 
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Table 1.17: Average Annual Returns for the First Year and Four Years of Democratic and Repub- 
lican Presidencies* to 1997 Update to 2010. 


January 1937— January 1929- January 1998- 
December 2010 December 2010 December 2010 


S&P500 Small Cap S&P500 Small Cap S&P500 Small Cap 


Democrat 

Avg 1% yr 8.79 13.08 11.86 19.87 26.46 27.19 
Avg 1% 2 yrs 8.74 12.65 10.19 16.07 23.49 17.16 
Avg last 2 yrs 14.10 21.11 15.32 21.83 5.97 9.12 
Avg Term 11.56 16.35 13.08 17.75 16.48 13.94 
Std Dev Term 16.21 26.79 17.95 29.71 15.19 15.46 
# years 38 38 42 42 5 5 
Republican 

Avg 1% yr 0.68 —4.05 —0.19 —11.18 —3.49 3.52 
Avg 1% 2 yrs 4.69 1.35 2.48 —4.92 —3.32 1.23 
Avg last 2 yrs 12.14 14.86 7.78 9.70 2.02 7.56 
Avg Term 8.41 8.11 5.01 2.43 —0.65 4.39 
Std Dev Term 16.96 21.21 21.24 27.28 21.51 24.90 
Avg 15¢ yr 36 36 40 40 8 8 

# years 36 36 40 40 8 8 


phenomenal growth in S&P500 index funds and much foreign investment in large 
cap stocks during this period. While small caps had very large returns, those of the 
S&P500 were even higher. 

We investigated how inflation varied with political regimes. The results for 
the 1929-1997 period, using the Ibbotson inflation index, indicate that inflation 
was significantly higher under Democrats, but this difference was contained in the 
1929-1936 period. Excluding this early period, inflation was slightly higher, on aver- 
age, under Democrats but not statistically different from inflation under Republican. 
Inflation rates differed across the years of the presidential terms. For example, for 
the 1937-1997 period, in the first year of the presidential term, inflation under the 
Democrats was significantly lower than it was under the Republicans. An analysis of 
the first and second two years of administrations during this same period indicated 
that inflation was higher under Democrats but the difference was not statistically 
significant. 


US bond returns after presidential elections 


The bond data are also from Ibbotson Associates and consist of monthly, continu- 
ously compounded total returns for long term corporate bonds, long term (20-year) 
government bonds, intermediate (5-year) government bonds, and cash (90-day 
T-bills). 

Figure 1.24(a) illustrates average return differences for bonds during election 
months and the subsequent 13 months (1929-1997) minus each months 1928-1997 
average return. Figure 1.24(b) updates this to 2010. Corporate, long term govern- 
ment, and intermediate government bond returns were all higher than the monthly 
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Table 1.18: Average Monthly Small- and Large-Cap Stock Returns During Democratic and 
Republican Presidencies, January 1929 — December 2011. 


January 1929 to December 2011 


Democratic Republican Dem diff. 
S&P500 small-cap small-S&P S&P500 small-cap small-S&P Rep diff 
January 1.33 5.35 4.02 1.25 4.88 3.63 0.38 
February —0.37 0.92 1.29 0.92 1.86 0.94 0.36 
March 0.16 —0.43 —0.59 0.69 1.19 0.50 —1.08 
April 2.30 2.81 0.51 0.24 —1.12 —1.36 1.87 
May 0.75 0.91 0.16 —0.11 —0.72 —0.61 0.77 
June 1.49 1.60 0.11 0.16 —0.32 —0.49 0.60 
July 1.85 2.45 0.60 1.10 0.36 —0.74 1.34 
August 0.73 0.88 0.15 1.41 1.19 —0.22 0.38 
September 0.43 0.92 0.49 —2.81 —3.22 —0.41 0.90 
October 0.90 0.04 —0.87 —0.21 —2.02 —1.81 0.94 
November 1.39 1.56 0.17 0.67 0.01 —0.66 0.84 
December 1.81 2.24 0.43 1.45 0.21 —1.24 1.67 
January 1929 to December 1997 (Hensel and Ziemba, 2000) 
ga Poe O ea o EO O a eni di 
S&P500 small-cap small-S&P S&P500 small-cap smal-S&P Rep diff 
January 1.72 6.45 4.73 1.65 5.93 4.28 0.45 
February —0.38 0.74 1.12 1.59 2.78 1.19 —0.07 
March —0.58 —0.91 —0.33 0.96 1.21 0.25 —0.58 
April 2.25 2.58 0.33 —0.24 —1.82 —1.58 1.91 
May 1.07 1.40 0.33 —0.50 —1.52 —1.02 1.35 
June 1.57 1.71 0.14 0.78 —0.40 —1.18 1.32 
July 1.95 2.81 0.86 1.69 1.11 —0.58 1.44 
August 1.17 1.65 0.48 1.73 1.25 —0.48 0.96 
September 0.40 0.78 0.38 —2.87 —3.31 —0.44 0.82 
October 0.42 —0.24 —0.66 —0.40 —2.66 —2.26 1.60 
November 1.44 1.61 0.17 0.44 —0.53 —0.97 1.14 


December 1.56 1.58 0.02 1.59 —0.09 —1.68 1.70 


Jan 1998 to Dec 2011 


sae ae SNE E eta 

S&P500 small-cap small-S&P S&P500 small-cap small-S&P Rep diff 
January —1.57 —2.82 —1.25 —0.37 0.67 1.05 —2.30 
February —0.32 2.27 2.59 —1.75 —1.81 —0.07 2.65 
March 5.62 3.13 —2.50 —0.37 1.11 1.48 —3.97 
April 2.66 4.54 1.88 2.14 1.67 —0.47 2.35 
May —1.61 —2.70 —1.09 1.46 2.50 1.04 —2.13 
June 0.90 0.81 —0.09 —2.31 —0.02 2.29 —2.37 
July 1.13 —0.20 —1.33 —1.24 —2.63 —1.39 0.06 
August —2.51 —4.79 —2.27 0.15 0.95 0.80 —3.07 
September 0.67 1.99 1.32 —2.59 —2.86 —0.28 1.59 
October 4.48 2.08 —2.40 0.54 0.52 —0.02 —2.38 
November 0.99 1.20 0.21 1.60 2.17 0.57 —0.36 


December 3.63 7.13 3.50 0.88 1.40 0.52 2.98 
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Figure 1.23: Cap Size Effects and Presidential Party, Democratic (Small Minus Large) Minus 
Republican (Small Minus Large). 


average in the year following an election only in May, October and November in the 
1928-1997 period. Both government bonds also exceeded the average in some other 
months. The update only has three elections and the monthly pattern is different 
than it was in the past. 

As Table 1.19 indicates, the performance of fixed income investments differed 
significantly between Democratic and Republican administrations. All fixed income 
and cash returns were significantly higher during Republican than during Demo- 
cratic administrations during the two study periods. The high significance of the 
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Figure 1.24: Stock Monthly Return Differences: Presidential Election Months and the Subsequent 
13 Months Minus Monthly Averages. 
Source: Hensel and Ziember (2000b) 


cash difference stems from the low standard deviation over terms. The performance 
of fixed income investments differed very little between the first two years and the 
last two years of presidential terms. 

The distribution of Democratic and Republican administrations during the 
1929-1997 period played a part in the significance of the fixed income and cash 
returns. As Table 1.19 indicates, the cash returns for the first four Democratic 
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Table 1.19: Annualized Average Monthly Return.* 


Term President Party Bonds Cash Bonds-Cash 
1929-1932 Hoover Republican 4.61 2.26 2.35 
1932-1936 Roosevelt Democratic 5.05 0.20 4.85 
1937-1940 Roosevelt Democratic 3.73 0.08 3.65 
1941-1944 Roosevelt Democratic 1.74 0.25 1.49 
1945-1948 Roosevelt /Truman Democratic 1.48 0.50 0.98 
1949-1952 Truman Democratic 1.24 1.35 —0.11 
1953-1956 Eisenhower Republican 1.19 1.66 —0.47 
1957-1960 Eisenhower Republican 4.24 2.54 1.70 
1961-1964 Kennedy/Johnson Democratic 3.21 2.84 0.37 
1965-1968 Johnson Democratic 2.76 4.43 —1.67 
1969-1972 Nixon Republican 7.06 5.19 1.87 
1973-1976 Nixon/Ford Republican 7.42 6.25 1.17 
1977-1980 Carter Democratic 3.17 8.11 —4.94 
1981-1984 Reagan Republican 13.71 10.39 3.32 
1985-1988 Reagan Republican 10.35 6.22 4.13 
1989-1992 Bush Republican 10.77 6.11 4.66 
1993-1996 Clinton Democratic 5.74 4.30 1.44 
1997—2000 Clinton Democratic 5.77 5.07 0.69 
2001-2004 Bush Republican 3.69 1.84 1.85 
2005-2008 Bush Republican 4.00 3.40 0.61 
2009-2011 Obama Democratic 2.82 0.17 2.65 


Source: updated from Hensel and Ziemba (2000) 

*From 1998-2011 we used the 3-month T-bill secondary market rate discount basis for cash and 
market yield and U.S. Treasury securities at 5-year constant maturity, quoted on investment basis 
for bonds. 


administrations in this period (1933-1948) were very low (0.20%, 0.08%, 0.25% and 
0.50% annually). This result largely explains why the term cash-return differences 
are so significant (t-value = —12.31 for 1929-1997). Democratic administrations were 
in power for three of the four terms during the 1941-1956 period, when government 
bonds had low returns. Bond returns in the 1961-1968 period (both Democratic 
terms) and 1977-1980 period (Democratic) were also low. 


Some simple presidential investment strategies 


Two presidential party based investment strategies suggest themselves. The first is 
equity only and invests in small caps with Democrats and large caps with Repub- 
licans; the second, a simple alternating stock-bond investment strategy, invests 
in small cap stocks during Democratic administrations and intermediate govern- 
ment bonds during Republican administrations. The test period was January 1937 
through December 1997 with an update from 1998 to 2010. 

The common 60/40 (large cap/bonds) portfolio investment strategy and provides 
a benchmark for comparison with the two strategies. Transaction costs were not 
included, but they would have a minor effect on the results because the higher return 
presidential strategies trade at most every four years. These investment strategies 
all lost money until the early 1940s, see Table 1.20 which shows the cumulative 
wealth. 
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Table 1.20: Value of $1 Initial Investment in 1997 and 2010. 


Large Cap Small Cap Presidential Presidential 60/40 
Date (S&P) (SC/Int) (SC/LC) Benchmark 
Jan 1937—Dec 1997 346.1 453.2 527.9 963.2 140.5 
Jan 1942—Dec 1997 639.0 2044.1 2380.9 4343.8 180.9 
Jan 1937—Dec 2011 574.7 919.5 1255.9 1349.2 247.0 


Jan 1942—Dec 2011 1061.1 4147.1 5664.2 6084.6 318.0 


Source: Hensel and Ziemba (2000), updated 
*Russell 2000 Index TR after December 1997 


The two presidential investment strategies performed well over the sample 
period. The strategy of investing in small cap stocks during Democratic admin- 
istrations and large cap stocks during Republican administrations produced greater 
cumulative wealth than other investment strategies. The alternating stock-bond 
strategy of investing in small cap stocks under Democrats and intermediate bonds 
under Republicans produced the second highest cumulative wealth. Both of these 
presidential party based strategies had higher standard deviations than large cap 
stocks alone during the 1937-1997 period. Clinton’s first administration had returns 
for small and large cap stocks, bonds, and cash consistent with the past. However, 
in the first fourteen months of his second administration large cap stocks produced 
higher returns than small cap stocks. 

In the update in Table 1.20 we see that, for the 1942-2011 period, small cap 
stocks (Russell 2000 from 1998) produced about four times the gains of large cap 
S&P500 stocks (4147.1 versus 1361.1). But the small cap with Democrats and large 
cap with Republicans was even higher at 6084.6. Meanwhile, the 60/40 portfolio 
was at 318.0 about an as much!. 

Table 1.21 displays the mean returns and standard deviations for the various 
subperiods for the various strategies. 


Remarks 


An interesting finding of this study was the much higher small-stock returns dur- 
ing Democratic administrations as compared with Republican administrations. This 
finding is consistent with the hypothesis that Democrats devise economic policies 
that favor small companies and consequently, their stock prices. The 33.51% differ- 
ence between small stock performance in Democratic and Republican administra- 
tions in the first year in office and the 18.21% difference for the full four-year term 
are very large. In 2011 up to the end of July, the Russell 2000 small cap index has 
moved above its 2007 high but the S&P500 has not. So the small cap advantage 
with a Democratic president is continuing. 

This political party effect is different from the well-known January small 
firm effect which has been present for Republicans as well as Democrats. We 
found in addition a substantial small stock/large stock differential outside of 
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Table 1.21: Average Returns and Standard Deviations for Different Investment Strategies for 
Different Investment Horizons. 


Strategies 
4 Large-Cap Small-Cap Pres (SC/Int) Pres (SC/LC) 60-40 
Dates years Mean StDev Mean StDev Mean StDev Mean StDev Mean StDev 
Jan 1937—Dec 1997 61 10.9 15.8 13.3 24.5 12.6 20.7 14.1 22.8 86 10 
Jan 1938-Dec 1997 60 11.8 15.5 14.9 23.7 14.3 19.8 15.8 22 9.2 9.8 
Jan 1948-Dec 1997 50 12.3 14 13.9 19.2 13.1 12.7 14.9 16.5 97 9 
Jan 1958—Dec 1997 40 11.6 14.2 14.7 20.2 14.6 13 15.6 17 9.7 9.3 


Jan 1968—Dec 1997 30 11.4 15.1 12.7 21.5 143 12.4 143 17.5 10.1 9.9 
Jan 1978—Dec 1997 20 15.4 14.7 163 195 159 13.8 17.2 17.9 12.9 9.9 
Jan 1988—Dec 1997 10 16.6 11.9 15.2 149 13.7 99 16.2 13.2 13.2 8.2 
Jan 1993—Dec 1997 5 18.4 106 17.7 13.3 17.7 13.3 17.7 13.3 13.5 7.4 
Jan 1995—Dec 1997 3 27.1 11.2 22.1 15.1 22.1 15.1 22.1 15.1 19.7 7.6 


Jan 1993—Dec 1993 1 9.5 6.1 19 9.4 19 9.4 19 9.4 10 4.3 
Jan 1993—Dec 1994 2 5.4 8.5 11.1 9.8 11.1 9.8 11.1 9.8 4.3 6.5 
Jan 1995—Dec 1996 2 26.3 8.4 22.9 14.2 22.9 14.2 22.9 14.2 19.3 5.9 
Jan 1997—Dec 1997 1 28.8 15.8 20.5 17.6 205 17.6 205 17.6 20.5 10.5 


Jan 1937—Dec 2010 74 8.9 16 12 23.5 11.9 20.1 12.5 22 7.9 11.7 
Jan 1938—Dec 2010 73 9.6 15.7 13.3 22.8 13.2 19.3 13.9 21.2 8.4 11.6 
Jan 1948—Dec 2010 63 9.7 146 12.3 18.7 12.1 13.7 12.8 16.5 8.7 11.5 
Jan 1958—Dec 2010 43 10.7 14.9 15.5 19.5 16.1 141 16 16.9 10.4 12 

Jan 1968—Dec 2010 33 10.2 15.6 14 20.3 16.3 14 149 17.3 11 12.7 
Jan 1978—Dec 2010 23 13.2 15.5 17.7 18.5 185 15.1 17.7 17.4 13.9 13.4 
Jan 1988-—Dec 2010 13 12.4 15 17.8 16 18.9 145 17.3 154 149 14.3 
Jan 2003—Dec 2010 8 7.7 15.2 12.4 20.5 9.7 13.3 9.3 17.4 5.9 9.1 
Jan 2008—Dec 2010 3 —0.3 16.7 6.2 21.9 8.3 15.7 4.9 19.7 4.7 10 


Source: Hensel and Ziemba (2000) and update to 2010 


January during Democratic rule (see Table 3). Large stock returns were sta- 
tistically indistinguishable between Democrats and Republicans, but bond and 
cash returns were significantly higher during Republican than during Demo- 
cratic administrations. We also confirmed and updated Huang’s finding that large 
cap stocks have had higher returns in the last two years of presidential terms; 
this finding applies regardless of political party and for both small and large 
cap stocks. 

A study of the differences in economic policies that lead to the divergence of 
investment results according to which political party is in office would be interesting. 
Clearly, candidates seeking reelection are likely to favor economic policies that are 
particularly attractive to the public; and those policies are consistent with higher 
stock prices. Cash returns did not differ significantly between the first and second 
two-year periods of Democratic and Republican presidential terms. 


1.9.3 Election cycles: Other literature 


Stoval (1992) and Hensel and Ziemba (1995, 2000) documented the Presidential 
Election Cycle effect, which exhibited that stock markets generally had low returns 
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during the first two years after an US Presidential election and high returns during 
the last two years. Other subsequent studies have documented the economically and 
statistically significant difference in equity returns during the first and second half 
of Presidential terms for Republican and Democratic administrations. Some studies 
use more detailed models. Wong and McAleer (2008) examine the cyclical effect that 
Presidential elections have on equity markets using a spectral analysis technique 
and an exponential GARCH Intervention model to correct for time-dependence 
and heteroskedasticity. They consider the period from January 1965 to December 
2003 using weekly data with dummy variables to designate the year of the term and 
the President’s party. 

Wong and McAleer find a cyclical trend that mirrors the four-year election 
cycle with a modified cycle of between 40-53 months. They find that stock prices 
generally fall until a low-point during the second-year of a Presidency and then 
rise during the remainder, peaking in the third or fourth year. During the current 
Obama Democratic administration, the low was in March 2009 in his first year and 
the market has doubled since then to the end of March 2011. Wong and McAleer 
also find this Presidential Election Cycle effect to be notably more significant under 
Republican administrations, leading them to posit that the Republican Party may 
engage in policy manipulation in order to benefit during elections relatively more 
than their Democratic counterparts. For instance, the second-year and third-year 
effect estimates are not significant for Democratic administrations. 

Wong and McAleer explain the Presidential Election Cycle as follows. During 
the first year of a Presidency, voters are on average optimistic, and Presidents are 
likely to put their most divergent and expensive new policies in place, because they 
have the mandate of the voters and re-election time is furthest away. These early 
measures are relatively disadvantageous to business profits and stock prices because 
they usually involve higher taxes and spending and possibly new regulations. Then, 
during the second year of a term, Presidents begin to alter their policies to ones 
that are less drastic and more voter-friendly. 

The Presidential Election Cycle effect persists when looked at by President and 
by party. For instance, the only two Presidents who did not exhibit the cycle effect 
were Ronald Reagan and Bill Clinton during their second terms, during which they 
would not have re-election incentives like first-term Presidents. Empirical results 
also find that Republicans who were subsequently re-elected had a positive effect 
during the second-year of their term instead of the negative effect expected by the 
Presidential Election Cycle hypothesis. This suggests these Republicans may have 
used government policies to their favor to win re-election and should be useful for 
incumbent Presidents to consider in their electoral strategy. This last conclusion, 
however, does not follow from the conflicting observation that bull markets have 
tended to coincide with sub-periods under Democratic administrations. Wong and 
McAleer conclude that this anomaly was present during most of the last forty years 
and is likely still present in the market. 
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1.10 Turn-of-the-month Effects 


Historically there have been high returns for both large and small cap stocks around 
the turn-of-the-month (TOM). Market advisors such as Merrill (1966), Fosback 
(1976) and Hisrch (1986) have argued that stocks advance at the TOM. Ariel (1987) 
documented this for the U.S. using equally and value weighted indices of all NYSE 
stocks from 1963-1981; see Figure 1.25a, b. 
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Figure 1.25: The U.S. Turn of the Month Effect, Mean Daily Percent Returns on Trading Days 
—9 to +9, 1963-1981. 


Source: Ariel (1987) 
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The five days —1 to +4 historically were the TOM and had a large amount of 
the monthly gains. Indeed that period and the second week actually had all the 
monthly gains. Ariel (1987) found the following portfolio gains 


Equally weighted Value weighted 


First half of trading month 2552.40% 565.40% 
Last half of trading month —0.25% —33.80% 


Nineteen years 2545.90% 339.90% 


Lakonishok and Smidt (1988) found that during the ninety year period, 1897-1986, 
the large capitalized Dow Jones industrials rose 0.475% during the four day period 
—1 to +3 each month whereas the average gain for a four day period was 0.061% 
with an increase in prices over 56% of the time. The average gain per month over 
these 90 years was 0.349%. Hence, aside from these four days at the turn of the 
month, the DJIA actually fell. 

The effect has continued in recent years even in the presence of index futures 
contracts which began trading in the U.S. in 1982. Hensel, Sick and Ziemba (1994) 
found for the period May 1982 to April 1992 using the S&P500 large cap and Value 
Line small cap indices, consistent with the previous evidence that about two thirds 
of the months gains occur on trading days —1 to +4, the turn of the month, and the 
rest of the months gains occur on trading days +5 to +9 so that all or more than 
all of the gains occurs in the first half of the month. The second half was at best 
noise. The effect was monthly dependent with the largest gains in January and size 
dependent with the small capitalized value line index of about 1650 stocks having 
higher means and lower standard deviations than the large capitalized S&P500 
index. There was partial anticipation in the futures market as shown in Figure 1.26. 
For the small capitalized value line index, the cash effect on day —1 was partially 
anticipated on days —4 to —2. Then the effect in the cash market on days +2 and 
+3 was partially anticipated on day +1. Hence, the cash market effect on days 
—1 to +4 was as Ariel found for the 1963 to 1981 data with small gains on days 
—4 to —2. For the large capitalized S&P500 index, the results were similar except 
that there are higher returns in the cash market in the anticipation period (—4 to 
—2) and lower returns in the —1 to +4 period. 

The reasons for the turn of the month effect are several but they are largely 
cash flow and institutionally based. See Hensel, Sick and Ziemba (1996) and Gon- 
zalez (2006) who discusses that paper. The U.S. economy uses a system where 
much money is paid on the —1 day such as salaries and bills and debt payments. 
In addition to some of this money being invested in the stock market, there are 
institutional corporate and pension fund purchases at that time. These cash flows 
vary by month and lead to higher average returns in January which has the highest 
cash inflow. Ogden (1990) presents some empirical support of this hypothesis and 
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Figure 1.26: Mean Percentage Daily Returns in the Cash and Futures Market for Small and Large 
Capitalized Stocks by Trading Day of the Month, May 1982 — April 1992. 


Source: Hensel, Sick and Ziemba (1994) 


related monetary actions for U.S. markets. Another factor in this effect seems to 
be behavioral. One manifestation is that bad news such as that relating to earnings 
announcements is delayed and announced late in the month while good news is 
released at the beginning of the month; see Penman (1987). Bouges, Jain and Puri 
(2009) show a turn of the month effect in the S&P ADR. 
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The TOM was similar in Japan except that the dates change with the turn being 
days —5 to +2, with +3 to +7 being the rest of the first half of the month. Ziemba 
(1989, 1991) investigated this. The reasons for the effect in Japan seem to be: 


e Most salaries were paid during the period of the 20-25 day of the month, with 
the 25t! being especially popular. 
There was portfolio window dressing on day —1. 


Security firms could invest for their own accounts on amounts based on their 
capitalization. Since their capitalization usually rises each month and is computed 
at the end of the month, there is buying on day —3 to account for this. Buying 
was done as soon as possible. 

Large brokerage firms had a sales push that on day —3 and lasted 7 to 10 days. 
Employment stock holding plans and mutual funds received money in this period 


to invest, starting around day —3. 

Individual investors bought mutual funds with their pay, which they received on 
calendar days 15 to 25 of the month; the funds then invested in stocks with a lag, 
so most of the buying occurred on days —5 to +2. 

For low liquidity stocks, buying occurred over several days by dealing in accounts 


to minimize price pressure effects. 


Using data on the NSA from 1949 to 1988 Ziemba (1991) found that all of the days 
—5 to +2 had significantly positive returns. As in the U.S. all the gains occurred in 
the first half of the month and the second half had zero or negative returns. 

Ziemba (1989) investigated the futures market trading outside Japan on the 
Simex in Singapore on the turn of the month and other anomalous effects in 
Japanese security markets during the period September 1986 to September 1988 
before there was futures trading on the NSA or Topix in Japan. He found that the 
spot effect was consistent with past data so the futures market did not alter the 
effect. However, the futures market in Singapore totally anticipated the effect on 
days —8 to —5 with a total average rise on 2.8%. Then when the effect occurred on 
days —5 to +2 and the spot market gained 1.7%, the futures market was flat. 

In our update for the U.S. markets using S&P500 and Russell 2000 futures data 
from 1993-2010 and 2004-2010, we found that the TOM effect still exists with a 
bit of anticipation. Figures 1.27ab and 1.28ab and Tables A.8abcd document the 
results. For example, for the S&P500, for the longer sample and also for the shorter 
more recent data, the days —5 to +2 all have positive returns except —1 and —2 
(which have small mean losses). For the Russell 2000 the same days all have positive 
mean returns for the longer sample and for the more recent data with —2 having a 
slightly negative mean. 


1.11 Open/Close Daily Trade on the Open 


Branch and Ma (2006) analyzes an anomaly that may contradict the weak form of 
the efficient market hypothesis. This hypothesis holds that a time series of returns 
should not contain meaningful autocorrelation, or, in other words, that a stocks past 
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Figure 1.27: S&P500 Futures Average Daily Returns During TOM by Day. 


returns should not have predictive power about future returns. In contradiction, 
Branch et al. find a very strong negative autocorrelation between the overnight 
return (between the close of the market and its opening the next day) and the 
intraday return (the portion that occurs during the day while the market is open). 
The study analyzes stocks on the NYSE, AMEX, and NASDAQ over two periods 
between 1994 and 2005, as well as broken down into size categories, and finds 
statistically significant results across each sub-sample. Branch et al. goes further to 
hypothesize that the cause of this anomaly is related to the behavior of specialists 
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Figure 1.28: Russell 2000 Futures Average Daily Returns During TOM by Day. 


or market makers and their strategies and incentives on how to open their assigned 
stocks relative to the previous days closing price. 

Branch et al. also consider whether there is a strategy that exploits this anomaly 
but offer only initial conclusions. According to their explanation of the basis of the 
anomaly, they reason that only market makers are currently able to exploit this 
anomaly because they have the benefit of knowing the balance of overnight orders. 
For the trading public, their main actionable conclusion for this analysis is advice 
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against a public trader placing an order to be executed at opening, which they 
equate to putting in an order at a disadvantageous price. 

Cooper, Cliff and Gulen (2008) exhibits that the US equity premium return over 
the decade from 1993 to 2006 has solely been a result of overnight returns, with 
intraday returns being close to zero. While past studies had shown that daily market 
closures had a relatively clear effect on trading volume and stock price volatility, the 
implications as far as return timing did not carry a consensus. Cooper et al. offer 
strong evidence supporting the hypothesis that the majority of returns are made 
when markets are closed. The find the difference between night and day returns 
is between 2.61 and 7.61 basis points per day and that these results are robust 
across asset types, sub-periods, and markets (including NYSE, AMEX, NASDAQ, 
and Chicago Mercantile Exchange). The study finds that typical explanations such 
as risk, earnings surprises, and illiquidity do not substantially explain this pattern 
and instead imply there is an inefficiency in market opening and closing mecha- 
nisms. Particularly they suggest this may carry tradable implications for portfolio 
managers, particularly those with low marginal trading costs. 


1.12 Industry Concentration 


Hou and Robinson (2005) analyze the asset pricing implications of industry market 
structure by running regressions of average monthly returns on industry concentra- 
tion from 1963 to 2001. They conclude that firms in the least concentrated industries 
(18* quintile) earn 4% per year higher returns than firms in the most concentrated 
(5*" quintile). The greater risk is called the concentration premium and is described 
by innovation and distress risk. The metric used to quantify industry concentration 
is the Herfindahl index, a relative measure of industry market share dilution, as 
defined by the three-digit SIC classification. They also find that the concentration 
premium is greater in industries with a higher book-to-value ratio. 

Investors could possibly set up a long-short strategy that is long in firms or 
industries with low concentration and short in firms or industries with high concen- 
tration. The 4% advantage is not that great so given transaction costs, this edge 
might be combined with other strategy edges. 


1.12.1 Weather: Sun, rain, snow, moon and the stars and clouds 


Hirschleifer and Shumway (2003) point out that psychologists have documented 
correlation between sunshine and behavior for decades. Sunshine has been linked to 
tipping (Rind, 1996) and lack of sunshine to depression (Eagles, 1994) and suicide 
(Tietjen and Kripke, 1994). People feel good when the sun shines and when they 
feel good they are more optimistic and may be more inclined to buy stocks thus 
leading to higher stock prices. 

Roll (1992) argues that 


“weather is a genuinely exogenous economic factor... it was a favorite example of an 
exogenous identifying variable in the early econometrics literature ... because weather 
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is both exogenous and unambiguously observable... weather data should be useful 
in assessing the information processing ability of financial markets.” 


See also Trombley (1997), Kramer and Runde (1997) and Loughran and Schulz 
(2004). 

Hirshleifer and Shumway (2003), following the earlier paper by Saunders (1993), 
examine the relation between morning sunshine at a country’s leading stock 
exchange and market index stock returns that day at 26 stock exchanges interna- 
tionally from 1982-1997. They find that sunshine is strongly positively correlated 
with daily stock returns. After controlling for sunshine, other weather conditions 
such as rain and snow are unrelated to returns. Keef and Roush (2007), using data 
on 26 international stock exchanges show the not surprising result that the sun- 
shine effect is monotone stronger the further one is from the equator and the per 
capita GDP. There is no effect at the equator and a big effect on northern stock 
exchanges. An interesting question for future research, that we are working on in 
a forthcoming paper, is does the sunshine effect have anything to do with “sell in 
May and go away” that suggests being out of the stock market in the sunniest time 
of the year? 

Yuan, Zheng and Zhu (2006) investigate the relationship between lunar phases 
and stock market returns in 48 countries. Stock returns are lower on the days around 
a full moon compared to the days around a new moon. The return difference is 3-5% 
per year based on equal and value weighted global portfolios. The result is not due 
to changes in stock market volatility or trading volumes. Also the lunar effect is not 
explained by announcements of macroeconomic factors or major global shocks, and 
is independent of calendar anomalies such as January, day of the week, calendar 
month or holiday effects. 


1.13 Conclusions and Final Remarks 


In the main, the anomalies are still there with some moving around. In the past, 
some of the anomalies such as the TOM and January effect had very high prediction 
accuracy. Currently the January barometer and sell in May and go away which deal 
with longer range predictions have similar reliability. Other anomalies such as the 
January and holiday effects still exist and add value. The monthly effect has become 
noise and has no predictive value. 

How did the anomalies do in practice this year? 

The TOY trades are in addition to the predominately S&P500 and Russell 
2000 option biases strategies but have added value and were just used in December 
2011. Dzhabarov and Ziemba have run a private account with various anomalies — 
seasonal and option biases, including the TOY trades in December 2009, 2010 and 
2011. The TOY effects are shown in the graph in Figure 1.29. The TOY trades can 
be seen at the far left and far right and in the middle of the wealth graph. 

What did Ziemba learn from Hirsch and Hirsch (2011)? 
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e=e\VTZMI Trading NAV Net of Commissions and 2% + 20% Fees 


—e-Mini S&P 500 futures B&H NAV 


start Dec 1, 2009 $100,000 
as of Dec 31, 2011 $147,795 


Figure 1.29: Private Account Managed by C. S. Dzhabarov and W. T. Ziemba, December 1, 2009 
to December 31, 2011. 


The following are some conclusions from the 2011 edition of the Stock Trader’s 
Almanac plus comments regarding how this compares to our own research. (From 
“The best of times for stocks may be ahead,” CNBC, October 26, 2010). 


1. The markets sweet spot during any four-year presidency cycle is historically the 
fourth quarter of the midterm year and first quarter of the pre-election year. 
Also there has not been a down third year of a presidential term since prewar 
1939 when the Dow Jones Average fell —2.3%. This agrees with our Presidential 
election calculations. 

2. History also shows the markets have historically performed the best when there 
is a Democrat in the White House and a GOP-controlled Congress, which is what 
we now have. Over the last 70 years, in fact, such combinations on Capitol Hill 
produced an average annual return of 15% or more. 

This is consistent with our calculation but we did not specifically investigate 
Democrat in White House with a Republican Congress. 

3. The months between November and April are generally good to investors. Since 
1950, the Dow Industrials have returned an average 7.4% between those months, 
versus less than 1% (0.4%) between the May—October period. A hypothetical 
$10,000 investment during the best six months of the year over that 60-year 
period would have yielded a $527,388 profit, compared with a $474 loss for the 
worst performing six months of the year during the same period. 

This is the basic yearly seasonality and it agrees with our calculations. 

4. They argue that there would be no January barometer without the passage in 
1933 of the 20° (Lame Duck) Amendment to the constitution. Prior to the 
1933 Congress and the President took office in March following an election and 
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Congress did not need to meet until December. So the period from the Novem- 
ber election to March created a four month Lame Duck. With the amendment 
congressional sessions begin on January 3°¢ and presidents are inaugurated on 
January 20°. This narrows the Lame Duck session and moves many important 
things into January including the president’s state of the union address and 
annual budget setting priorities for the year and thus creating a momentum for 
the rest of the year. 


. From Yahoo! Finance, October 20, 2010: 


If history is any guide, the secular bear market that began in 2000 still has about 
seven more years before the bulls take over. And when they do, get ready for 
what Hirsch describes as a “super boom” in stocks. 

“The super boom’s not really going to kick off until 2017 after we shake out all 
this financial crisis,” according to the Hirsches. They predict that the Dow will 
hit 38,820 in 2025, a 500% gain from the intraday lows of March 2009. 

“From the last bottom in 1974 it took eight years before the market really took 
off in 1982 and then another eight to move up the rest of the 500%, in line with 
Yale Hirsch’s prediction in 1976 for a 500% market move by 1990. A 500% rise 
in the Dow over 16 years from the intraday low of 6470 on March 6, 2009 would 
put the Dow at 38,820 in 2025.” 

It’s more than just wishful thinking — it’s a return to the market’s norms. It 
only takes a 7%-8% gain, compounded annually, to get to 38,820, they note. In 
fact, the Dow rose 1,400% during the bull market of 1982 to 2000. 

What, then, will drive this super boom? 

First, the end of war brings peace and a generally pro-business environment. It 
happened after World War II and Vietnam, and the Hirsches bet that it will 
happen after the wars in Iraq and Afghanistan. 

Second, inflation. “After all the major wars of the 20th century,” stocks took off, 
“when the inflation from government spending kicks in.” 

Third, enabling technologies. They point to the assembly line after World War 
I, which revolutionized business; suburban expansion after World War II that 
created a middle-class consumer in need of appliances and more homes; and 
after Vietnam it was IT and communications that sparked innovation. Renewable 
energy and biotechnology, they believe, will be the next big thing. 


Well, we will see. This is the story of compounding. So if earnings grow by 7-8% 
per year, the stock prices may well have these gains. 


Table A.1: Russell 2000 —S&P500 Futures Spread Average Returns by Month, 1993-2011. 


MOY Trading R2000-S&P500 futures spread. Geometrically Linked Returns. 
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Table A.2: 


S&P500 Futures Average Monthly Returns, 
MOY Trading in S&P 500 futures contract. Geometrically Linked Returns, 


1993-2011 and 2004-2011. 
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Table A.3: 


Russell 2000 Futures Average Monthly Returns, 1993-2011 and 2004-2011. 
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TOM Trading in S&P 500 futures. Geometrically Linked Returns. 


Table A.4: S&P500 Futures Turn-of-the-Month Returns, 1993-2011 
M Trading in S&P 500 futures. Geometrically Linked Returns. 
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Table A.5: Russell 2000 Futures Turn-of-the-Month Returns, 1993-2011 and 2004-2011. 
TOM Trading in Russell 2000 futures. Geometrically Linked Returns, Entry on close 5th Business Day before End of Month. 
Exit on close 2nd Business Day of Next Month. 


193| 1994| 1995| 199%., 1997| 19%, 199| 2000| 2001. 2002| 2003| 2004| 2005 2006 2007| © 2008) 2009, 2010 2011 Average [StDev t -A 
Jan 0.99% | 0.25%| -0.77% 2.01%) 3.21% 4.04%) -1.51%| 3.73% 2.20%] 0.89%) 2.91%) -2.67% 0.22% 1.20%] 5.90%) 681%) 051%| -0.79% 0.96% 0.028 1.441, 
Feb 259%| 0.03% 206% O71%| 411% -0.67%| -253%) 0.11% -183%| 0.91%) 4.04%] 4.05%, 0.85% 3.19%] 4.62% 0.45%] -0.62%| 213% 0.90% 0.024) 1.607) 


1.70%| 025% -1.23%| 1.31%) -1.28%| 5.32%| -0.67% 485%) 0.72%) 3.26%| 3.05%) 1.09% -7.03%| 4.37%] -1049%| 252%] -041% -0.08% 0.039) -0.087) 
0.28%} 1.56% -4.00%| 216%) 2.40%| -12.96%| -5.56% -0.15%| 0.93%) 5.75%] -0.15% 1.00% 048%| 1.08%| 581%| 0.72%) 3.23% -0.16% 0.043] -0.163) 
0.70% 0.54% 5.54%| 0.72% -1,57%| 10.28%] 6.07% 1.19%| 381%| 4.17%] 0.66% 0.19% -0.20% 1.98% 5.25%) -3.67%| 0.13% 1.72% 0.035, 2,139 
O74%| 0.61% 085%| -3.16%| -085%| 10.05%) -081% 4.28%] 6.15%) 3.83%] 214% 137% 345%] 1.85%] 10.32%] 297%] 1.21% — 1.92% 0.038) 2.208) 
0.96%| -0.75% 0.30%! 0.97%) 2.71%] 0.63%] 239% -6.26%| 4.50%) 0.76%] 4.20% 4.37% 2.26%| 5.28% 0.77% -6.06%| 5.83% 0.67% 0.034) 0.855) 
0.67%] 380% 1.64%] 842%) -1.48%] 3.21%) 254% -0.76%| 1.23%] 1.69%] 211%) 045% -336%| 068%] 3.83%) -1.07%| -8.18% -0.52% 0.034] 0.009) 
0.87%! 083% 222%] -9.71%) -298% 2.33% 3.61% 6.62% 467%| 246%] 080% 338% 047%| 277%| 4.76%] 444%] 0.66% -0.03% 0.038) -0.031' 
268%] 1.51% 221%] 6.16% 265%) -203%| 230%) 1.61%) -046%| 4.11%] 0.20%) -143%] 285% 8.92%| -350%| 0.18%] -386% -0.58% 0.034) -0.742 
Nov 1.51%| 1.42%| 1.90%| 0.80%. 7.21%| 3.68% 3.50%| 6.84%) -140% 342%] 438%| 206%| 210%| -273% -1.70%| 22.54%| -4.63%| 0.86%] 2.20% 276% 0.057) 2.125 
Dec) 1.88%] 1.06%] 4.23%| 226% -0.49%| 046%) 1.41%| -4.75%| 1.74%) 048%] 264%| 1.79%| 1.11%| 072%) 262%| 065%| 061%| 213%| 878% 1.49% 0.025 2581] 


Average 1.48% 0.42% 0.69% 0.78% 1.41% -0.98% 066% 0.60% 0.58% -051% 221% 1.70% 1.36% 0.79% 0.35% 086% 087% 0.24% 0.9% 0.75% 
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Oct) 2.15%) -1.81% 
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t 3.39%| 0.736| 1.463| 1818 1.672) -0.711, 0961| 0.309) 0.617) -0.486| 3.100) 1.959) 2347 1471 0.394) 0.374| 0.508) 0.290| 0.731 3.033) 
Geomr | 15.68%} 4.97% | 8.43%) 9.66% 17.80%) -1229% 7.86%) 4.79%) 6.52% -6.68%| 29.52%) 21.85%| 17.31% 9.70% 3.77%| 7.35%) 8.85%) 247%) 10.38% 
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Table A.6: Data by Month for Sell in May and Go Away Versus Buy and Hold for the S&P500, 1993-2011. 
SIM Trading in SP500 futures contract. Geometrically Linked Returns. Entry at Close on 6th Business Day before End of October, Exit Close of 1st Business Day of May. 


T 


2003 2004 2008) 2009| 2010, 2011/Average StDev t 
Jan 149 69%| O. 2.98% 1.82% ~682%| -9.14%| 3.72%) 230% -0.28% 0.041 
6.30%| 3.76% 173%) 1.28% -3.63%| -11.13%| 2.98% “1.21% 0.083 
5.65%] 4.49% 044% -182% -0.85%| 7.28%] 557% 1.51% 0.042 
0.70%) 319% 8.00% -1.73% : 455%] 9.07%| 147% 1.85% 0.040 
1.35%| 201% 0.02% 0.97% | 1.99% 0.71%] 1.29% 0.81% 0.007) 
0.49%) 0.43% 0.10% 0.09% 0.17%) 0.02%| 0.02%) 0.01% 0.28% 0.002 
047%] 041% 0.09% 011% 7 0.17%| 0.01%! 001% 
047%) 044% 0.08% 0.13% 0.16%| 0.01%) 0.02% 
0.45%) 043% 0.09%. 0.13% 0.16% 0.01%! 0.02% 
16%| -0.07% 
5.06%) 1.04% 0.75% 383% -9.22%| 588%| -0.11% 
| 180%] -1.93% 6.92%) 6.60% 482% 334% 027%) 1. -040%| 1.40%) 6.18% 
1.64% 115% 1. 2.60% 2.00% 0.96% 0.95% -0.24% 0.00% 1.14% 
ooa) 0.021! o. 0026| 0.029 0.029. 0.019 I 0.051] 0060| 0.027 
1.868 3.463) 2430|) 0. L149 1776 D160) -0001| 1.459 


14.40% | 35.62% | 26.34% ‘40% | 11.67% 1186%| 1. -4.19%| -202% 


ntract. Geometrically Linked Returns. 
1997. 1998) 1999 2003 2004 2008. 2009) 2010. 2011 Average StDev 
5.69%| 0.73% : -298% 1.82% -682%| -9.14%| -3.72%| 230% -0.28% 0.041 
0.28%| 6.30% 173%) 1.28% -3.63%| -11.13%| 2.98%] 335% -1.21% 0.043 
424%! 5.65% 0.44% -182% -0.85%| 7.28%| 5.57%| -0.51% 151% 0.042 


5.69% 0.70% “| 8: : : 9.07%| 147%) 289% 1.85% 0.040 
5.84%] -2.60% 96%] 5.27%| -8.41%| -1.21% 
452%| 461% 5.94%] -223% 0.040 
7.47%] -1.86% ; 734%| 6.80%) -216% 
5.91%) -15.59% 05% 4%] 0. 3.46% 4.69%) -6.40% 
551%| 6.86% ji f 3.16%| 831%| -7.85% 
378%| 738%) 573% 551%. 132% 00% | -20.11%| -207%| 3.74%) 10.51% 
3.21%| 506%] 1.04% 0.75% 3.83% -922%| 588%] -0.11%| -0.61% 
240%] 6.92%) 6.60% 482% 334%) 0: 1.80%) -0.56%| -0.40%| 140%] 6.18%] 037% 
2.22% 20% 140% 189% 0.72% 1.06% 0.21% -452% 1.67% 1.02% 
0.046| o066) 0.039| o. ; 0.033) 0.021 0.017| 0.029| 0.067) 0.064| 0.056 
1687| 1.063) 1.229 L97 1172 2135| 0257| -2321| 0.903 0.634 


33.54% | 19.34% | 28.75%| 23.93%) 17.18%| -1273%| -15.61%| -26.10%| 24.53% 8.76%| 285%| 13.29%| 2.11% | -44.22%| 19.28% 
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Table A.7: Data by Month for Sell in May and Go Away Versus Buy and Hold for the Russell 2000 Futures, 1993-2011. 


SIM Trading in Russell 2000 futures contract. Geometrically Linked Returns. Entry at Close on 6th Business Day before End of October, Exit 1st Business Day of May. 
T 


1994 1996, 1997 1998, 1999 2000 2011 Average StDev 


2.95%] -1.60%| -0.32%| 2.04%] -248%| 0.32%) -3.17% -041% -0.96% 0.047 
-143%| 4. L58% | -287%| 7.31%| -8.02%| 16.64% | 5.34% -0.34% 0.064 
443% 348%| -4.20%| 5.20%| 1.15%) -6.53% 208% 1.32% 0.043 
-0.55%| 23 4.59%| -0.10%| -0.13%| 847%] -7.92% 256% 188% 0.056 
1.01%) 1.65%] 0. 1.07%] 1.70%] 0.94%] 1.58%) 3.29% -131% 0.94% 0.010 
0.27%) 0.35%) 0. 040%| 047%) 049%/ 043%) 054% 0.01% 0.28% 0.002 
0.25%| 0.34%] 0: 049%| 047%) 0.47%| 041%) 0.56% 0.01% 0.29% 0.002 
0.27%) 0.40%) 0. 043%! 0.44%) 047%! 0.44%) 0.55% | 0.01% 0.29%  0.002/ 
0.25%) 0.39%] 0. 045%] 049%| 045%| 043%) 0.52%| 0.24%) 0.15%) O 0.01% 0.28% 0.002 
147%| 2.27%) 0.13%| -0.25%| 5.75% 1.82%| 3.11%] 5.39%] 214%] 0.12% 088% 1.88% 0.042 
4.28%) 4.75%) 4. 3.89%| -1.16%| 4.28%] 4.69%) -10.54%| 7.38%! 8.50% “1.00% 0.71% — 0.064 
412% 281%] 3. 201%] 2.05%| 6.76%| 1258%| 8.02%] 5.94%| -5.87% | 0.03% 3.47% 0.040 


Average -0.02% 0.00% 1. 1.49% 0.42% 213% 213% 0.61% 1.09% 0.67% 1 0.68% 0.85% 
StDev a 0.025 J 0.017) 0.025} 0.030} 0.050) 0.074] 0.043) 0.040 . . 0.018 
t : -0.029 


0.004) 2. 3.090 0579| 2432| 1.474) 0.286| = 0.866) 0.581 


Geomr | -0.57% | -0.31% 17. 19.18% | 4.83%| 28.12%| 27.12%) 4.49% 746% 


B&H Trading in Russell 2000 futures contract. Geometrically Linked Returns. 


1993, 1994 1995) 1996 1997) 1998; 1999 2000 2001 2011 Average StDev 
Jan 2.95% | -1.60%] -0.32%| 2.04%) -248%| 0.32%) -3.17%| 3.91% ji -0.41% -0.96% 0.047 
Feb| -4.20%| -1.43%| 4.17%| 1.58%| -2.87%| 7.31%| -8.02%| 16.64%| -7.33% 3 5.34% -0.40% 0.064 
259%] -4.43%| 138%| 3.48%] -4.20%| 5.20%| 1.15%] -6.53%] -3.92% 208% 132% 0.043 
3.05%! -0.55%| 239%| 4.59%] -0.10%| -0.13%| 8.47%) -7.92%| 6.33% 256% 188% 0.056 
4.00% | -1.16%| 2.21%| 3.17% 10.25%) 6.34%) 1.64%) -7.30%| 1.83% j| %| . -2.02% 0.80% 0.055 
151%|-349%| 4.64%] -3.66%| 4.75%| 1.13%| 4.66%) 9.34%] 3.00% -283% 0.44% 0.047 
055%) 2.00%) 528%] -932%| 4.13%) -9.34% 388%) 352%) -504% | 378% -1.39% 0.068 
3.95%| 5.42%) 144%) 5.01% 1.98% | -20.31% | 3.99%] 6.08% -3.70% i -10.27% -0.64% 0.065} 
247% | -0.58%| 182%| 4.17%] 7.70%] 8.02%| 0.36%] -1.99%| -13.82% “12.25% 0.15% 0.068 
2.96% | -0.40%| -5.11%| -1.50%| -513%| 3.75% 044%) -548%| 524% T414% -0.12% 0.078 
-128%| -4.75%| 442%| 3.89%] -1.16%| 4.24%| 4.69%] -10.54%| 7.38% “1.00% 0.71% 0.064 
4.12%| 284%| 3.13%) 201% 2.05% | 6.76%| 12.58% 8.02%| 5.94% -0.03% 3.47% 0.040 
0.97% -0.30% 2.02% 1.09% 1.62% -0.18% 1.53% -0.54% -0.02% -0.71% 0.45% 
0.033! 0.031) 0.029! 0.042| 0.047| 0.084! 0.056) 0.085) 0.066 . J I I . 0.069 
| 0.976) -0.329) 2.388] 0.896] 1.205] -0076| 0.948 -0219| -0.009 231) 1.098) 722| 0.924) -0.357 


|Geomr | 10.56% | -4.03%| 26.46% | 12.78%| 19.88%| -6.17%| 18.06%] -9.77%| -2.67% 15.38% | -4.95%| 42.69%) 18.43%] 21.75%) -10.51%] | | | 
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The “turn-of-the-year” effect is a well-documented stock market phenomenon in which low capitalization “small stocks” 
receive relatively higher returns than high capitalization “big stocks” on the last trading day of December and the first 8 
trading days of January. The difference in returns during this period is of the order of 10%. Strategies for buying and 
selling these small stocks may be profitable, but may also incur large transaction costs that eliminate most or all of the 
projected gains. In this paper, we show a preferable way to invest in order to exploit this anomaly: use a futures spread 
that is long in the small stocks and short in the big stocks. The optimal investment, which uses a modification of the 
capital growth criterion, is large and has a substantial expected gain with minimal risk. We have used this analysis 


successfully in managing investment accounts. 


his paper reviews the literature on the small-firm, 

turn-of-the-year effect and provides an invest- 
ment strategy for exploiting this empirical regularity. 
We begin by discussing the actual returns received 
from various investments over the last 60 years. The 
evidence indicates that small firms—that is, those with 
low capitalizations—greatly outperform other invest- 
ments. Most of the excess gains over larger capitali- 
zation firms occur in January, particularly in the first 
few days of the month. We examine this evidence and 
try to explain why this anomaly occurs and is so 
regular. (Indeed, it has been one of the most consistent 
of the stock market empirical inefficiencies.) 

William Ziemba’s interest in the facts and invest- 
ment potential of anomalies in the stock market arose 
in connection with a workshop he gave at the Los 
Angeles ORSA/TIMS meeting in April 1986 (Ziemba 
1986) and with his forthcoming book on market 
anomalies (Ziemba 1988). Ross Clark has been a 
commodity trader interested in technical and other 
speculative investment strategies. Together we have 
developed some decision rules on when and how 
much should be invested in the tum-of-the-year play, 
using index futures. We have used the strategy suc- 
cessfully in a number of private investment accounts. 


Subject classification: 197, 213 turn-of-the-year effect. 
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We first discuss the evidence for the turn-of-the- 
year effect, and then present our analysis of strategies 
for taking advantage of it. Finally, we describe our 
most recent experience in applying our preferred 
strategy. 


1. The Evidence 


Ibbotson Associates (1986) have considered the actual 
returns received from investments in United States 
assets with different levels of risk during the period 
1926-1985 (Figure 1). The results indicate that: 


* So-called “riskless investments”—T-bills, essen- 
tially—earn the rate of inflation, which averaged 
3.1% in those 60 years; their real rate of return is 
near zero, namely 0.3%: $1 in 1926 grew to $7.47 
by the end of 1985, with the price level at 6.10 (all 
rates of return are geometric annual averages). 

« Long-term government bonds perform similarly; 
they earn the rate of inflation plus 1% per year: $1 
grew to $11.03. 

+ Long-term corporate bonds have a rate of return of 
1.7% over inflation, or 4.8%: $1 in 1926 grew to 
$16.55 by the end of 1985. 
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Figure 1. Wealth indices for investments in the 
U.S. capital markets (1926-1985). Year-end 
1925 = 1.00. (Source: Ibbotson Associates.) 


+ Common stocks, as measured by the S&P 500 com- 
posite index, returned 6.7% plus inflation, or 9.8%, 
with 1926’s dollar being worth $279.12 in 1985. 

+ Small stocks, those New York Stock Exchange se- 
curities in the bottom fifth in capitalization, re- 
turned 9.5% plus inflation, or 12.6%; for these 
stocks, $1 grew to $1,241.24 in the 60 years. 


While small stocks outperformed common stocks by 
more than 4 to | in terms of cumulative wealth levels, 
their advantage has accrued only in the last two dec- 
ades, and most of the gains are in the 1974+ bull 
market. These returns are calculated on before-tax 
income, so that net return after taxes, adjusted for 
inflation for “riskless” investments and bonds, may 
well be negative for many investors. 

The difference in the total value of small-versus 
large-stock portfolio values is striking. Reinganum 
(1983a) computed the average stock price return by 
capitalization (Table I), with yearly rebalancing for 
the 18 years, 1963-1980, The results are overstated 
because Reinganum used daily data which has a bid- 
asked spread bias. 

Small stocks outperformed large stocks by more 
than 10 to 1 in this period. (Investors of small capi- 
talized stocks typically must sell at the bid and buy at 
the asked. Also, they move the market; see Stoll and 
Whaley 1983. Hence the average annual return of 
Portfolio 1 is probably overstated by 5-6%, so that 


the 10 to 1 edge, if properly measured, is more like 
4 to 1, which is less dramatic but still substantial.) 
During 1981-1986 (May), small stocks {the bottom 
two deciles) continued to outperform common stocks 
as a whole: they rose to $2.7737 per $1 by the end of 
1985 versus $2.3503 for common stocks (Ibbotson 
1986). Small stocks did better in 1981-1983. How- 
ever, common stocks, as measured by the S&P 500 
index, performed better in 1984-1987 (July). 

The higher returns for small stocks and for common 
stocks, as opposed to bonds or T-bills, were achieved 
by bearing greater risk. Table II shows Ibbotson’s data 
for the standard deviation of returns and the number 
of losing years during 1926-1985, 

Banz and Breen (1986) have pointed out that the 
Ibbotson data, as well as other data series based on 
the COMPUSTAT tape, suffer from at least two biases 
besides the bid-ask spread just mentioned. An ex-post- 
selection bias arises because the data base contains 
only companies that are currently viable, and excludes 
those that have merged, filed for bankruptcy or oth- 
erwise ceased to exist. Also, new companies enter the 
data base with a full history but without any data in 
the file at earlier dates. The look-ahead bias refers to 
the fact that data reported at particular times are 
usually not available to investors until dates in the 
next year. Banz and Breen have constructed a data 
base free of these biases. Their tests show a bias in the 
ordinary COMPUSTAT tape that favors small stocks. 
Hence the Ibbotson results are biased and slightly 
overstate the advantage of small stocks over medium- 
and high-capitalization stocks. The exact extent of the 
combined biases is not known. Note that tests based 
on CRSP data such as Reinganum’s do not suffer 
from these biases, 

Since 1981, the academic world has been fascinated 
by this small-firm effect and other anomalies. See 
Seligman (1983), Schwert (1983), Dimson (1986), 


Table I 
Average Stock Price Return by Capitalization 


Portfolio Decile edge Annes 
1 (smallest) +23.7 $452,800 
2 +17.9 185,000 
3 +18.5 201,600 
4 +16.2 139,500 
5 +15.2 127,900 
6 +15. 126,800 
7 +12.9 88,200 
8 +118 75,000 
9 +114 67,000 

10 (largest) +8.2 41,000 
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Table II 
Rates of Return and Their Risk (1926-1985) 
Number of 
Series Geometric Standard Losing 
Mean Deviation Years out of 
60 Years 
Small stocks 12.6% 36.0% 19 
Common stocks 9.8 21.2 19 
Long-term. 48 8.3 14 
corporate bonds 
Long-term govern- 4.1 8.2 16 
ment bonds 
U.S. Treasury bills 3.4 3.4 1 
Inflation 3.1 49 9 


Source: Ibbotson Associates. 


Keim (1986) and Ziemba (1986) for surveys of aca- 
demic research, and Lakonishok and Smidt (1987) for 
a 90-year look at the supposed anomalies. Rolf Banz 
(1981) and Marc Reinganum (1981), wrote the pi- 
oneering papers on the small-firm effect. Using a 
capital asset pricing model, they argued that even 
when the outcomes are adjusted for risk, small stock 
returns are higher. Brown, Kleidon and Marsh (1983) 
showed that the excess returns were linear in the log 
of size. The central question of interest to financial 
economists is whether the small-firm effect is a true 
anomaly or whether the extra gains are simply pay- 
ment for added risk once it is properly measured, 
since small firms tend to have higher market risk betas. 
Papers by Roll (1981) and Blume and Stambaugh 
(1983) have explored this question. Donald Keim 
(1983) made the important discovery that a large 
portion of the excess returns of small over large stocks 
occurs in January. He estimated that nearly 50% of 
the excess returns occur in this single month, of which 
26% are in the first week. Keim’s study suffers from 
the bid-asked spread bias which Blume and Stam- 
baugh correct for. Moreover, Blume and Stambaugh 
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Figure 2. Small stock portfolio minus S&P 500 
index (1926-1983). (Source: Ibbotson Associates.) 


argue that all of the size effect is in January, i.e., that 
the size effect is about 0.60% per day in January and 
zero the rest of the year. Tinic and West (1984), using 
data from 1935-1982 not controlled by size, showed 
that the positive relationship between risk and return 
is unique to January. The risk premiums in the re- 
maining months are not significantly different from 
zero. In further work on the same data, Ritter and 
Chopra (1987) found that the positive relationship 
exists only with small stocks. In essence, the pricing 
model is valid for small stocks only in January. De- 
spite these attempts to explain the small-firm effect 
with capital asset pricing and arbitrage pricing model 
tests, the academic world has not been able to find a 
defensible risk measure that brings the small-firm 
excess returns in January back into line. Gultekin and 
Gultekin (1983) show that the small firm effect occurs 
all over the world. 

In Figure 2, the Ibbotson data show that a small 
stock portfolio for the years 1926-1983 returned 
nearly 6% (arithmetic averages) more in January than 
the S&P 500 index. 


Table ILI 
Mean Return Differences between the 
Equally and Value-Weighted Indices 


January Trading Day during 
1963-1978 


December First Second Third Fourth 


Mean return 0.5647 1.186 0.6067 0.6107 0.4527 
t Statistic {4.72) (8.39) (3.86) (3.96) (3.05) 


Last Day 
of 


Moreover, Richard Roll (1983b), using data from 
1983 to 1978, found that most of the January gains 
(37% of the total, or 3.45%) are made on 5 days: the 
last trading day of December (the —1 day) and the 
first 4 trading days in January (+1 to +4), and much 
of the balance on the next 4-6 trading days. In total, 
67% of the 9.31% average annual return differential 
(biased upward slightly because of the bid-asked 
spread) for 1963-1980 between equally weighted and 
value-weighted indices of NYSE and AMEX stocks is 
attributable to the first 20 calendar days of January 
plus the last trading day in December. The mean 
return differences between the equally and value- 
weighted indices by trading day for the turn of the 
year days are very high and are all significant 
(Table IID). 

Jay Ritter (1986) has calculated that the average 
difference in returns between the smallest and largest 
deciles of market values on the New York Stock 
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Table IV 
Seasonal Behavior of Weekly Stock Market 
Absolute Risk Premiums (1963-1979): Rm — Rr 


Day Value-Weighted Equal-Weighted 


Relative to Market Returns Market Returns 
End of 

Year Rm- Ry t-Ratio Rm-R;  t-Ratio 

Rest of year —0.000679 —0.75 0.000223 0.22 

-12 0.003594 0.75 —0.001366 —0.25 

-Il —0.001362 -0.29 —0.001989 -0.36 

-10 —0.001967 —0.41 —0.009102 —-1.67 

-9 0.003528 0.74 0.001211 —0.22 

-8 —0.000377 -0.08 -0.001968 —0.36 

-7 0.003027 0.63 -0.000384 —0.07 

—6 —0.003648 -0.76 —0.004073 —0.75 

-5 0.008022 1.68 0.010486 1.92 

-4 —0.001206 -0.25 —0.002594 -0.48 

-3 0.000425 0.09 —0.000749 —0.14 

-2 0.001957 0.41 —0.000339 —0.06 

-I 0.006834 1.43 0.007857 1.44 

1 0.009289 1.95 0.030053* 5.51 

2 0.002526 0.53 0.020034* 3.67 

3 0.007946 1.67 0.015631* 2.87 

4 —0.000901 —0.19 0.006264 1.15 

5 0.002637 0.55 0.008836 1.62 

6 -0.000147 -0.03 0.002608 0.48 

7 0.001778 0,37 0.005080 0.93 

8 0.000694 0.15 0.001705 0.31 

9 —0.003851 —0.81 —0.002509 —0.46 

10 0.009044 1.90 0.009873 1.81 

11 0.005101 1.07 0.005775 1.06 

12 —0.004289 —0.90 -0.002252 -0.41 

13 0.004722 0.99 0.004235 0.78 
R-squared= 2.3% 7.4% 

Durbin-Watson statistic = 1.91 1.39 


* Significant at 5%, using a two-tailed test. 


Exchange is 9.99% (and ranges from 3.0% to 41.5%) 
for the nine trading days (—1 to +8) during the 
14 years 1971-1984. Table IV from Smidt and Stewart 
(1984) shows that risk premiums are much higher for 
small stocks than for large stocks on days (—1 to +4) 
and that they continue to be higher on days (+5 to 
+12), The highest returns, consistent with the Roll 
data and with Lakonishok and Smidt’s (1987) 90-year 
study, are on days (+1 to +3). 


2. Analysis of the Evidence 


Academic research on the turn-of-the-year effect has 
focused on three questions: 


+ What causes this anomaly? 

e Can its excess returns be explained by increases in 
risk? 

+ Can an investor make excess profits net of transac- 
tions costs by buying small stocks at the turn of the 
year? 


We now describe the current state of the conclusions 
reached so far for each question (for more details, see 
the cited references). 


Probable Causes 


The causes of the turn-of-the-year effect seem to be: 
(a) tax loss selling, (b) renewed buying interest in small 
stocks in the new year because of the availability of 
excess cash balances that, because of the low trading 
volumes, cause upward price pressure, (c) high trans- 
actions costs that prevent these patterns from being 
arbitraged away, (d) portfolio manipulations, and (e) 
turn-of-the-month and quarter price rise effects. We 
will look at the main points in turn. 


Tax Loss Selling. Branch (1977), Dyl (1977), Rein- 
ganum (1983b), Givoly and Ovadia (1983), Rozeff 
and Kinney (1976), Rozeff (1985a, b) and Wachtel 
(1942) have shown that securities with price declines 
in the previous year have high returns in January. 
These returns become even larger as the previous 
year’s decline increases. Williams (1986), using a ra- 
tional expectations model, shows that securities with 
heavy tax loss sales in December have larger compet- 
itive risk-premia and thus higher expected returns in 
January. Constantinides (1984, 1986) argues that, on 
logical grounds, though tax selling seems to be in- 
volved with the tumm-of-the-year effect, it cannot be 
the sole cause. Chan (1985) and DeBondt and Thaler 
(1985) found that the excess returns in January of 
stocks sold in December may last for as many as five 
years. Ritter observes that a higher percentage of small 
stocks are held by noninstitutional investors, who have 
a far greater incentive than institutions do to sell stocks 
that have declined in price. Losses can be deducted 
only following sales. Gains and losses realized prior to 
the last five trading days of the year affect that year’s 
taxes. Gains and losses realized in the last five days 
may be allocated to either year. Hence, during these 
last five days, the tax selling of losers is reinforced by 
the selling of winners by investors deferring the tax 
liability on the profits. Ritter, using transaction data 
from Merrill Lynch from mid-December 1970 to mid- 
December 1984, showed that in these 14 turn-of-the- 
year periods, the buy/sell ratios of individual investors 
are low for small stocks on days (—10 to —2) and high 
on days (+1 to +8). Fully 60% of the excess return on 
small stocks at the turn of the year is explained by 
buy/sell ratio changes. 


Renewed Buying Interest in Small Stocks in Janu- 
ary. Individual investors have available funds from 
tax loss sales, and locked-in profits from sales in 
the last five days, that appear on year-end account 
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statements, Christmas bonuses and other year-end 
financial receipts. They use these funds to buy stocks 
they consider underpriced, a high percentage of which 
are small stocks. Blume and Stambaugh, and Stoll 
and Whaley, have shown that the small firms are 
essentially represented by low-priced securities. 
Indeed, there is a significant positive relationship 
between mean excess return and the inverse of price- 
per-share. A portfolio of the lowest price stocks dom- 
inates a portfolio of the highest priced stocks by about 
10% per year during the 25 years from 1955 to 1979. 
Ken McNeil, a Calgary broker with Wood Gundy, 
mentioned that the action in January in low-priced 
stocks occurs when they hit key values such as $2, $5 
and $10, At that point, investors jump on the bargain 
bandwagon. 


Transactions Costs. Stoll and Whaley (1983) show 
that for the lowest-priced stocks, the total transaction 
costs average 2.93% bid-ask spread and 1.92% two 
way transaction, or nearly 5% for a round trip trade. 
Hence, although it seems advantageous to buy small- 
firm or low-priced stocks at the turn of the year 
because of the quick initial gains, virtually all of the 
excess gains would be eaten up with the transaction 
costs of a quick sale. Ritter estimates the (—1 to +8) 
gain of small stocks over big stocks to be 9.99% over 
the years 1971 to 1984. In addition, the gain would 
not be a capital gain. Stoll and Whaley find that 
the break-even holding period at which the after- 
transaction costs’ abnormal return is zero is about 
4 months for the smallest stocks. Partly because of 
these large transaction costs, we do not seem to ob- 
serve large amounts of this short-term trading. On 
average, these stocks are held and not sold in this 
period. 


Portfolio Manipulations. Positions and bonuses are 
largely based on performance. Hence, in late Decem- 
ber, portfolio managers may attempt to improve per- 
formance artificially by bidding up stocks already in 
their portfolios. This and other reasons lead to a shift 
to sales at the asking, from the bid on day (—1). For 
more on this phenomenon, see Lakonishok and Smidt 
(1984), Roll (1983a, b) and Ritter. 


Turn of the Month and Quarter Price Rises. Ariel 
(1987a, b), using data from 1963 to 1981, has shown 
that stock prices, on average, rise in the first half of 
each month and are flat during the second half; see 
Figure 3. This idea dates at least to Merrill (1966). 
Fosback (1976) has studied it and has discussed and 
successfully tested it, using no-load mutual funds and 
stock index futures, in his newsletter, Market Logic; 
see, for example, the May 1, 1987 issue. See also 
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Figure 3. The turn-of-the-month effect: 1963-1981. 
(Source: Ariel 1987.) 
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Hirsch (1979, 1986). The major effect is again on days 
(—1 to +4), with the strongest effect occurring during 
January. The effect on the first month of the other 
three quarters is larger than the other eight months. 
Lakonishok and Smidt (1987) give an extensive view 
of the effect over 90 years of data on the high capital- 
ization Dow Jones Index. While they find the effect 
Ariel reports for recent periods, it does not occur over 
the entire period. They treat (—1) as part of the pre- 
vious, rather than the succeeding, month. The turn- 
of-the-month effect seems to be attributable to several 
factors: end-of-month portfolio adjustments by insti- 
tutions, investment of monthly stock purchase plan 
receipts by mutual funds, and monthly salary and 
other receipts by the investing public. 


Excess Returns 


Can the excess returns be explained by increases in 
risk? The jury is out on this crucial question. However, 
some good insights appear in Rogalski and Tinic 
(1986). They utilized data from 1963 to 1982 on all 
New York Stock Exchange and AMEX securities, 
developing 20 equally weighted portfolios based on 
size. Portfolio 1 has the smallest stocks and 20, the 
largest. They reached the following conclusions. 


+ The hypothesis that the mean return for portfolio 1 
in every month is equal is false, but it cannot be 
rejected once January is eliminated. 

There is a seasonality in risk, as the following data 
show. 

The £ coefficients of portfolios 1-5 are much larger 
in January than in any other month when measured 
using daily data and the CRSP equally weighted 
index as the market return (8, the covariance of the 
portfolio, with the market portfolio divided by its 
variance, is the measure of the portfolio’s risk, con- 
sistent with the capital asset pricing model). 

+ The higher 8’s of small stocks in January are not 
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solely attributable to thinner trading volumes during 
that month. 

« The January £ of the smallest firms is almost twice 
that of the largest firms. 

» The January variance of portfolio 1 is nearly four 
times that of portfolio 20. 

+ The precise results are sensitive to the precise data 
set used, but the conclusions are robust. 


Excess Profits 


Can an investor make excess profits, net of transac- 
tions costs, by buying small stocks at the end of the 
year? 

The transactions costs are about 5% each way for 
small stocks and the potential gains are in the 10- 
15% range. Hence, there may be a small amount of 
gain left after a two-way transaction, but the gain is 
likely to be small. 


3. Strategies 


The subject of profits brings us to the thrust of this 
paper. There is strong evidence, as Figure 2 shows, 
that small stocks outperform large stocks at the turn 
of the year. Yet transactions costs eat away most, if 
not all, of the potential gains. Such costs on index 
futures are a tenth or less of the corresponding basket 
of securities. Hence, a strategy that should be very 
profitable is to hold long positions in a small stock 
index and short positions in large stock indices. 
Stock index futures began trading in 1982 in the 
United States, and the number of different contracts 
available, and their volume, have increased steadily. 
The ideal way to play the turn-of-the-year effect is to 
be long on the smallest stocks and short on the largest 
stocks in liquid index contracts. Our experience is 
with the spread between the Value Line and S&P 
indices, called the VL/S&P spread. This strategy is 
not an ideal way to play the effect, but it is the best 
we have found so far, and it has been successful. The 
VL index is an equally weighted geometric average of 
the prices of nearly 1,700 securities with futures traded 
on the Kansas City Board of Trade and futures options 
traded on the Philadelphia Stock Exchange. IBM and 
the smallest company in the index are treated equally 
in the weighting. The VL index has a downward drift 
of about 5.5% per year relative to the component 
securities in the index because of geometric averaging, 
due to the geometric-arithmetic averaging inequality. 
The amount of the downward drift depends upon the 
variance of price movements of the component secu- 
rities. The bias is approximately equal to half the 
average unique risk; see Modest and Sunderasan 


(1983). The higher the unique variances, the higher 
the drift. For more on the mathematics of the VL 
index, see Eytan and Harpaz (1986). The S&P 500 
futures contract is traded on the Chicago Mercantile 
Exchange, and is value weighted. Hence, IBM and the 
other large stocks count much more than the medium- 
size stocks at the bottom of the index. Hence, in a 
crude fashion, the VL/S&P spread gives you the small 
stocks long and the large stocks short. The bigger the 
stocks, the shorter they are. However, all the small 
stocks and medium stocks are held in the same pro- 
portion. 

The VL contains few of the smallest decile stocks 
on the NYSE, but it is the best index that is traded for 
our purposes. All of the other index futures traded in 
the United States are either value weighted or com- 
prise only a small number of large capitalization se- 
curities, such as the Major Market Index. For both 
the S&P and VL futures, four contract months are 
traded: March, June, September and December. The 
contracts currently expire on the third Friday of the 
contract month. (Until December 1985 the VL ex- 
pired on the last trading day of the third month.) This 
day is referred to as the triple witching day since 
futures, options and futures on options all expire at 
that time. On occasion this day has had tremendous 
volatility because program traders must unwind their 
positions before expiry. Beginning with the June 1987 
S&P contract, new rules went into effect: trading now 
ends on Thursday’s close but settlement is based on 
Friday’s opening prices. Whether or not this cumber- 
some procedure will work remains to be seen. The VL 
contract’s expiry remains at Friday’s close. 

For maximum liquidity and the smallest bid-ask 
spreads, especially since the VL/S&P spread trade 
must be made over two exchanges, it is best to use the 
March contract. Also, since the trade should be com- 
pleted at the end of January at the latest, there is no 
need to consider the June contract. The S&P contract 
is very liquid and trades about 75,000 contracts a day, 
but the VL is much less liquid, trading about 3,000 
contracts a day. Each spread entails two commissions 
{possibly computed as only 1.5 times a single futures 
contract commission), payable on sale, which cost 
from as much as $200 at a full-service broker down 
to $50 or less at a discount broker, Each contract 
is worth $500 times the index value. Hence, each 
point difference gains or loses the trader $500. Thus, 
0.05-0.40 is needed to cover the transactions costs. 
The margin requirements are about $1,100 per spread 
and are subject to change by the exchanges. (These 
margins will be much larger for the 1987/88 turn of 
the year because of the effects of the October 19, 1987 
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stock market crash.) Typically an investor keeps one 
or more $10,000 T-bills for this security plus a separate 
interest-bearing account to cover offside positions. 
The 1 — | weighting seems to minimize the risk of 
the overall position, since the standard deviation of 
the S&P and VL index futures are about 1.82% and 
1.74% per day with a daily correlation of 0.967, 
according to Donaldson, Lufkin and Jenrette’s Fu- 
tures Service. If the standard deviations and correla- 
tions change over time, the optimal spread may not 
bel —-1. 

When should an investor trade? We have found that 
the following rule works well: buy the spread on the 
first closing uptick, starting on December 15 and 
definitely by the 17th, and sell on January 15. Waiting 
until (—1) now seems to be too late: possibly the 
number of finance professors and their colleagues, as 
well as other students of the turn-of-the-year/January 
effect who are in on the strategy, move the VL index. 
There seems to be a bidding up of the March VL 
future price relative to the spot price. Table V shows 
the results from the ten years 1977-1986. By January 
15, the biggest gains are over and the risks increase. 
On average, the spread drops 0.92 points in this 
period, with a high variance. The projected gain from 
a successful trade is 0-5 points and averages 2.85 
points or $1,342.50 per spread, assuming a commis- 
sion of 1.5 x $55. On average, the December 15 to 
(—1) day gain on the spread is 0.57 points. However, 
it was 1.05 in 1985 and 3.15 in 1986, which may 
reflect the fact that with the thin trading in the VL 
index, the market can be moved with a reasonably 


small number of players who are learning about the 
success of this trade, i.e., the basis was bid up antici- 
pating the January move. We made this trade in 
1984/85, 1985/86 and 1986/87. The closing uptick 
rule sharpens the edge slightly. For example, in 
1985/86, the spread closed at 2.30 on December 17 
and we were able to buy it as low as 1.65 and 1.40, 
versus the 2.80 on December 15. The differential of 
the cash spread to the future spread has been of great 
assistance. As in most speculative markets, prices 
change after the market has exhausted the staying 
power of those players who make ill-timed decisions, 
and the cash/futures spread when employed as an 
oscillator (i.e., overbought/oversold) assists us in mar- 
ket entry timing around our time windows of Decem- 
ber 15-17 and January 15. Figures 4 and 5 graphically 
display the VL/S&P spread during December, for 
these ten years, One additional rule that would have 
added to profits in 1977/78, 1979/80, 1981/82, 
1982/83 and 1983/84 is to double up the position if 
the December 15 to (—1) period results in a loss for 
the trade. Table VI demonstrates the results of this 
modification: The profits from the trade triple from 
$993.80 to $2,724. 


5. The 1986/87 Play 


For our 1986/87 play we attempted to optimize our 
investment. The data in Table V yields an estimated 
mean return of +2.85 per contract. Dennis Capozza, 
a colleague at the University of British Columbia, 
estimated that the average standard deviation of the 


Table V 
Results from VL/S&P Spread on Various Buy/Sell Dates for the Ten Turn-of-the-Years (1977-1986)% 
Turn of the Spread Spread Spread Spread Difference Difference Difference < “Trade Gato Net Profit 
Year Dec.15 (1) Jan. 18 EndJan, «= Dees 15 (1) to Jan. Sto Dec. 150 Sn trade? 
i ij ü to(—1) Jan. 15 End Jan, Jan. 15 
Spot Prices 

1976/77 —14.74 -14.07 -10.92 —9.02 +0.67 +3.15 -1.90 +2.94 $1,387.50 
1977/78 -1.17 2.42 0.67 1.24 -1.25 +3.09 +0.57 +0.98 $407.50 
1978/79 0.79 4.53 3.44 5.40 +3.74 —=1.09 +1.96 uE pA $477.50 
1979/80 12.00 10.87 15.82 15.18 -1.13 4.95 0.64 +3.82 $1,827.50 
1980/81 9.16 11.77 11.22 11.24 +2.61 -0.55 +0.02 +2.06 $947.50 
1981/82 14.78 13.91 14.90 11.86 —0.87 +0.99 —3.04 +0.12 —$22.50 
1982/83 18.03 17.06 22.61 20.10 -0.97 +5.55 —2.51 +4.58 $2,207.50 

March Futures Prices 
1983/84 30.85 29.40 32.10 27.70 —1.43 +2,69 —4.40 +1.26 $547.50 
1984/85 11.80 12.85 16.60 19.50 +1.05 +3.75 +2.90 +4.80 $2,317.50 
1985/86 2.80 5.95 6.20 4.02 +3.15 +0.25 -2.18 +3.40 $1,617.50 
Average +0.57 +2.28 —0.92 +2.85 $1,342.50 


* The years 1974/75 and 1975/76 had extremely strong effects as well. 
t With 1.5 x $55. = $82.50 transactions cost. Lower transactions costs, in the $40-$50 range, are possible with discount brokers. 
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Figure 4(a-e). VL/S&P spread in December and 


January for 1976/77 to 1980/81. 


VL/S&P spread in 1986 was 3.00. This assessment 
yields the following approximate return distribution 
for the trade. 


Probability 
0.007 
0.024 
0.070 
0.146 
0.217 
0.229 
0.171 
0.091 
0.045 


Gain 
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For repeated investments over time, the capital 
growth criterion that involves the maximization of the 
expected logarithm of returns has many desirable 
properties: it maximizes the asymptotic rate of growth 
of the investor’s fortune; it minimizes the expected 
time to reach a preassigned (sufficiently large) goal; a 
period-by-period myopic optimization policy is opti- 
mal; it allows the investor to invest more as the 
situation becomes more favorable; it can take into 
account possible effects of the investor's purchases on 
the return distributions and thus allow for simple 
operational decision rules for actual applications; and 
it has the never-risk-ruin property of logarithmic util- 
ity. See e.g., Kelly (1956), Brieman (1961), Thorp 
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Figure 5(a-e). VL/S&P spread in December and 
January for 1981/82 to 1985/86. 


(1975), and Hakansson (1971, 1979). Ziemba and 
Hausch (1986, 1987) give a practical discussion of 
these concepts in layman’s terms, along with simula- 
tions. 

In practice, the capital growth, or Kelly criterion, as 
it is known in the speculative investment literature, 
tends to suggest extremely high wagers when the in- 
vestment situation is very favorable. Still, on balance, 
it seems to be the most desirable investment strategy 
concept for repeated speculative situations involving 
a small number of asset choices. 

For the turn-of-the-year play with a mean return of 
2.85 and standard deviation of 3.00, the optimal wager 
is a staggering 74% of one’s fortune! Indeed, in the 


long run, according to the theory, with such a strategy, 
the investor will, with probability approaching one, 
accumulate more money than investors with essen- 
tially different strategies. Still, with the uncertainty 
involved, lower wagers are suggested, especially if the 
distribution just described is overoptimistic. After all, 
the data in Table V have a sample size of only ten. 
Moreover, there are fluctuations, margin calls and the 
like to consider. MacLean, Ziemba and Blazenko 
(1987) have developed methods to generate a com- 
plete trade-off of risk versus return in such dynamic 
investment situations, using “fractional Kelly strate- 
gies.” The lower the fraction of the “optimal Kelly 
wager” invested, the higher is the security and vice 
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Table VI 
Results from VL/S&P Spread with the Double-Up Modification 
Two 
a A Net Profit 
Difference Times : 
Turn of the Year Tr Dierenees Trade Gain Net Front poets 
to) CD to Modification 
Jan. 15 
Spot Prices 
1977/78 1.25 6.18 4.93 $2,300 $407.50 
1979/80 —1.13 9.90 8.77 $4,220 $1,827.50 
1981/82 —0.87 1.98 Lil $390 —$22.50 
1982/83 —0.97 11.10 10.13 $4,900 $2,207.50 
March Futures Prices 
1983/84 =1.43 5.38 3.95 $1,810 $547.50 
Average =1.13 6.91 5.78 $2,724 $993.80 


versa. Figure 6 displays the probability of doubling, 
tripling and tenfolding one’s fortune before losing half 
of it, as well as the growth rate, for various fractional 
Kelly strategies. At fractional strategies of 25% or less, 
the probability of tenfolding one’s fortune before halv- 
ing it exceeds 90%, with a growth rate in excess of 
50% of the maximal growth rate. Figure 7 gives the 
probability of reaching the distant goal of $10 million 
before being ruined for Kelly, half-Kelly and quarter- 
Kelly strategies with wealth levels in the range of 
$0-$10 million. The results indicate that the quarter- 
Kelly strategy seems very safe with a 99+% chance of 
achieving this goal. 
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Figure 6. Relative growth rate and the probability 
of doubling, tripling and tenfolding before halving 
for various fractional Kelly strategies. 


We employed the concepts in a $100,000 specula- 
tive account for a client of CARI Ltd., a Canadian 
investment management company. We decided to 
purchase five VL/S&P spreads to approximate a 
slightly less than 25% fractional Kelly strategy. Watch- 
ing the market carefully, we bought these on Decem- 
ber 17, 1986 at a spread of —22.18, which was very 
close to the minimum price of the spread. On Decem- 
ber 15, the spread closed at —20.90, and on the 16th 
at —22. It increased in value to —18.15 at the end of 
the year (in a flat and declining stock market) for a 
gain of 3.85. In January the stock market took off in 
an impressive style, with ten consecutive up days. The 
spread continued to gain, and we cashed out at —16.47 
on January 14 for a total gain of 5.55 points per 
contract, or a total gain of $14,278.50 after transac- 
tions costs. Figure 8 displays the spread during Decem- 
ber 1986 and January 1987. The spread began to drop 
sharply on the 15th and the drop escalated into a rout 
of the small stocks in comparison with the big stocks 
in the S&P 500. At the end of January the index stood 
at —31.45. Our experience demonstrates that the trade 
must be handled carefully and that, more or less, the 
December 15th to January 15th period is the best time 
to trade. More and more players are moving the 
market in the December 15-31 period, and as in 
1985/86, most of the gains occurred during that 
period. 


6. Conclusions 


The turn-of-the-year effect offers many interesting 
avenues for exploration. We will decribe a few. 


+ The effect might be played using options, since the 
variances and $’s of small stocks are much larger 
than large stocks in January. 
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Figure 7. The probability of reaching $10 million 
before ruin for Kelly, half-Kelly and quarter-Kelly 
strategies. 


+ A play might be devised using the Toronto Stock 
Exchange (TSE) 300 index, since Tinic and Barone- 
Adesi (1986) found that excess returns in January 
are nearly 4% and returns are negative in the other 
months. See also Berges, McConnell and Schlar- 
baum (1984) and Tinic, Barone-Adesi and West 


. 


. 


(1988). In in addition, the new Canadian capital 
gains tax revisions of June 1987 may give added 
pressure to the TSE in December 1987 and 1988 
and thus improve the chances for and the extent of 
these two turn-of-the-year plays. 

The Japanese stock market is another possible area 
for investigation; Kato and Schallheim (1985) give 
evidence for a strong January effect over the period 
1952-1980. 

Another strategy might involve U.S. stock indices 
such as the NYSE and Major Market, and the 
discrepancies between spot and futures prices: see, 
for example, Cornell (1985), Cornell and French 
(1983a, b), Modest and Sundaresan (1983) and 
Figlewski (1983, 1985). 

A study might be undertaken to determine the effect 
of the new United States tax laws on the turn-of- 
the-year effect. 


* Better weighting and investing strategies might be 


devised using dynamic asset allocation strategies. 


e Proper tests and ways to devise data sets need to be 


developed to evaluate the real strengths and risks of 
these effects. 

Finally, some basic questions remain unanswered. 
How much of this “data snooping,” as Lakonishok 
and Smidt (1987) call it, is real? Is the small first 
effect a measured relationship or a real anomaly, 
and how much learning is going on to move the 
market toward efficiency? On this point, see also 
Merton (1986) and Black (1986). 
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Figure 8. VL/S&P spread in December 1986 and January 1987. 
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Arbitrage Strategies for 
Cross-Track Betting on Major 
Horse Races* 


I. Introduction 


Racetracks and securities markets have many 
characteristics in common. A difference, though, 
is their complexity; the racetrack is really a se- 
quence of markets that are relatively simple, 
short-lived, and, for the most part, independent. 
This ‘‘market-in-miniature’’ feature makes the 
racetrack attractive for tests of market effi- 
ciency, especially since, as Thaler and Ziemba 
(1988, p. 162) suggest, ‘‘one can argue that wa- 
gering markets have a better chance [than securi- 
ties markets] of being efficient because the condi- 
tions (quick, repeated feedback) are those which 
usually facilitate learning.” The many empirical 
racetrack studies support a weak form of ef- 
ficiency for some of the available wagers, while 
other types of wagers seem not to be efficient. 
These studies are reviewed in Section II. 

This article studies cross-track betting, a rela- 
tively new form of wagering. It allows bettors to 
wager at their track (a cross track) on a race 


* Without implicating them, we would like to thank Bruce 
Fauman and Fraser Rawlinson. Also, we greatly appreciate 
the data supplied by a number of U.S. racetracks, and we 
wish to thank Victor Lespinasse for suggesting the one-track 
model. 
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Cross-track betting per- 
mits bettors to place 
wagers at their local 
tracks on a race being 
run at another track. 
Since each track oper- 
ates a separate betting 
pool, the odds can vary 
across the tracks. The 
data suggest that the 
odds vary, and they of- 
ten vary dramatically, 
allowing arbitrage op- 
portunities. This article 
employs a risk-free ar- 
bitrage model to dem- 
onstrate the cross-track 
inefficiency and recom- 
mends an optimal capi- 
tal growth model for 
exploiting it. A simpler 
method is proposed for 
a single bettor at a sin- 
gle cross track. The re- 
sults indicate that these 
methods would have 
worked well in practice 
on a number of recent 
Triple Crown races. 
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TABLE 1 Home-Track and Cross-Track Betting, Kentucky Derby 

Home-Track No. Cross Home-Track Cross-Track 
Year Attendance Tracks Betting ($) Betting ($) 
1982 141,009 ihe 5,011,575 
1983 134,444 hes 5,546,977 Tae 
1984 126,453 24 5,420,787 13,521,146 
1985 108,573 32 5,770,074 14,474,555 
1986 123,819 56 6,165,119 19,776,332 
1987 130,532 73 6,362,673 20,829,236 
1988 137,694 93 7,427,389 24,449,058 


being run at another track (the home track). Since cross-track betting 
tends to be limited to major races, it gives the racing public an opportu- 
nity to bet on some of the world’s finest racehorses. This makes it very 
popular with the public. Cross-track wagering can lead to increased 
attendance and revenues at the cross tracks and add to the revenues of 
the home track through a fee (usually 5% of the handle) paid by the 
cross tracks. Thus, all the tracks can increase profits. ' 

Separate pools for each track means the payoffs at the various tracks 
can differ.? Due to the costs of arbitrage in this setting, market 
efficiency across the tracks would, for practical purposes, allow some 
differences across the various sets of track odds. Considerable differ- 
ences, however, would suggest the possibility of a market inefficiency. 
The data demonstrate that considerable differences do occur. For ex- 
ample, a $2.00 win ticket on Ferdinand, the winner of the 1986 Ken- 
tucky Derby, paid from $13.20 at Fairplex in Pomona, California, to 
$90.00 at Evangeline Downs in Lafayette, Louisiana.’ Obviously, bet- 
tors would have preferred their win bets on Ferdinand to be made at 


1. For an example of this effect, consider the home-track and cross-track betting on 
the Kentucky Derby (see table 1). The introduction in 1984 of cross-track betting on this 
race has greatly increased total Derby wagering. At the same time, cross-track betting 
seems to have had, at worst, only a minor effect on home-track wagering. The cross 
tracks’ revenues can increase also. For instance, Illinois set a one-day pari-mutuel 
record on Kentucky Derby Day, 1987. Cross-track betting on the Derby accounted for 
$1,326,239 of the $4,534,879 wagered that day in the state. Another example is Calder 
Race Course in Florida. They set all-time revenue and attendance records on Kentucky 
Derby Day, 1985. Attendance was 23,105, and $2,775,645 was wagered, $562,453 of it on 
the Derby. 

2. In some cases, all the wagers at the various tracks are summed. Then, on the basis 
of these summed values, identical payoffs are made at all the tracks. This is often called 
‘‘intertrack’’ wagering and typically the tracks are within one state. Intertrack wagering 
will not be considered here. 

3. These extreme payoffs are not just limited to the smaller tracks. Two large-track 
examples are Hollywood Park, where Ferdinand paid $16.80, and Woodbine Racetrack 
in Toronto where he paid $79.60. Alysheba, the 1987 Kentucky Derby winner, paid from 
$15.80 at Hollywood Park, California to $30.20 at Beulah Park, Ohio. Winning Colors, 
the 1988 derby winner, paid $7.40 at Pimlico Race Course, Maryland, and $10.40 at 
Beulah Park. 
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Evangeline Downs. The nature of the pari-mutuel betting system re- 
quires that if Ferdinand paid less at Fairplex than at Evangeline 
Downs, then another horse, were it to have won, would have paid 
more at Fairplex than at Evangeline Downs. Thus, if we are able both 
to learn the odds and place our bets at various tracks, it appears that 
significant arbitrage opportunities may exist. 

Section III develops a risk-free arbitrage model to demonstrate this 
cross-track inefficiency. The optimal capital growth model is studied in 
Section IV on the general cross-track problem and, in Section V, ona 
simpler one-track problem. These models are tested on data from sev- 
eral recent Triple Crown races. A final discussion is in Section VI. 


II. Efficiency of the Various Betting Markets 


Among the possible wagers at the track are the so-called straight wa- 
gers to win, place, and show. They pay off when one’s horse is at least 
first, second, or third, respectively. The ‘‘exotic’’ wagers include 
quinellas (requiring one to name the first two horses), exactors (requir- 
ing the first two horses in the correct order), trifectas (requiring the first 
three horses in the correct order), and daily doubles (requiring the 
winners of two consecutive races). Tracks have also extended the 
daily-double concept to picking the winners of three, four, six, and 
even nine consecutive races. These are very low-probability bets that 
can have tremendous payoffs, and they are very popular with the rac- 
ing public. Before exotic wagering was offered by the tracks, bettors 
could use parlays and other combinations of wagers to construct low- 
probability/high-payoff situations. Rosett (1965) analyzed these pos- 
sibilities and demonstrated that, except for extreme long shots, the 
bettors were rational in the sense that a simple bet would not be made if 
a parlay with the same probability of success had a greater return. 
Similarly, Ali (1973) showed that the return on a daily double is not 
significantly different from the return on what is an identical wager, the 
corresponding parlay of win bets. 

Unlike typical casino games, where the odds are fixed, the odds at 
the track are determined by the relative amounts the bettors wager on 
the horses and by the track’s transactions costs (the track’s take and 
breakage). Thus, ‘‘prices’’ are determined at the track much like they 
are in securities markets. Let 


n = the number of horses in a race; 
T = the number of tracks accepting wagers on the race; 
W; = the total amount bet by the public at track ¢ on horse i to 
win(@i=1,...,nand/=1,...,7); 
W' = 3, Wi = the win pool at track t; and 
Q' = track čs payback proportion (typically from .80 to .86). 


ll 
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Then the payoff per dollar bet to win on horse i at track ¢ is 
O'W'/W: if horse i wins, 0) 
l 0 otherwise. 


To determine the place payoff let P} be the amount bet to place on 
horse i at track ż¢ and let P' = £;P; be track t’s place pool. The payoff 
per dollar bet to place on horse i at track ¢ is 


(2) 


1 + (Q'P' — Pi — P)/(2P;) if the first two horses are i and j, 
l 0 if horse i is not first or second. 

Thus, the track keeps (1 — Q')P' and the place bets on i and j are 
repaid. The remainder, the losing bets minus the track take, is then 
split evenly between those who bet on i and those who bet on j. The 
share for i bettors is then divided on a per-dollar-bet basis. The place 
payoff on i does not depend on whether i was first or second but it does 
depend on which horse j was the other top finisher. In a similar fashion 
the payoff per dollar bet to show on horse i at track ¢ is 


1 + (Q'S' — Si — Si — S})/@BS!) if the first three horses are i, j, 
and k, (3) 
0 if horse i is not at least third. 


Here Si is the show bet on horse i by the public at track 7, and S’ = }$;S; 
is track t’s show pool. It is possible that (Q’S’ — S} — S; — Si)/3S}) is 
less than 0.05, or even negative. In these cases, called minus pools, the 
track usually agrees to pay $0.05 profit for each dollar wagered. Minus 
pools can also occur in the win and place markets, but are much less 
common. Equations (1)—(3) ignore breakage, the additional charge that 
results from the track rounding all payoffs down to the nearest 5 or 10 
cents cn the dollar.* 

If prices reflect all available information then a market is said to be 
efficient (see Fama 1970). There are two conclusions that can be drawn 
from the many studies of win market efficiency (see, for instance, Ali 
[1977]; and Snyder [1978]). First, the North American public underbets 
favorites and overbets longshots, and this bias appears across the 
many years that data have been collected and across all sizes of race- 
track betting pools.’ Second, despite its strength and stability, this bias 
is almost always less than the track take and thus it cannot be exploited 
to achieve positive profits. Figure 1 illustrates the favorite/longshot 


4. Breakage may seem like a relatively minor cost, but we (Ziemba and Hausch 1987) 
demonstrate that it can have a dramatic long-run effect on a bettor’s fortune. 

5. Busche and Hall (1988) demonstrate an opposite bias for Hong Kong bettors. Con- 
trary to the North American bettors, Hong Kong bettors tend to overbet favorites and 
underbet long shots—a bias that is consistent with risk aversion. 
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Fic. 1.—Expected return per dollar bet versus odds level: aggregation of 
studies involving more than 50,000 races. (Source. —Ziemba and Hausch 1986.) 


bias and shows that the track take (assumed to be 15.33%, as it is in 
California) is sufficiently large to preclude profits. The one profitable 
exception is extreme favorites at odds of 3-10 or less. However, they 
are relatively rare. Thus, if we define a market to be weakly efficient 
(see Fama 1970), if no one can devise a profitable trading rule based on 
historical price information, the win market is, for practical purposes, 
weakly efficient. 

Hausch, Ziemba, and Rubinstein (1981) tested the efficiency of the 
place-and-show markets. They assumed: 

ASSUMPTION 1. If q; is the probability that horse i wins, then the 
probability that jis first and j is second is q,q,/(1 — q;), and the probabil- 
ity that i is first, j is second, and & is third is 


aga — gM - qi — gi. 
(These formulas were developed and tested by Harville [1973].)® 


6. Henery (1981) and Stern (1987) show that these equations can be derived by as- 
sociating with each horse an independent exponential random variable with a scale 
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ASSUMPTION 2. The win market is efficient, so the win odds can be 
used to estimate q;. 

Using these two assumptions and equations (2) and (3), Hausch, 
Ziemba, and Rubinstein (1981) were able to identify horses that were 
underbet to place or show. The optimal capital growth model then 
determined the place-and-show wagers that maximized the expected 
rate of growth of one’s bankroll. Since the exact model is a com- 
plicated nonlinear optimization problem that is difficult to solve at the 
track, Hausch, Ziemba, and Rubinstein (1981) developed simple re- 
gression approximations with quite minimal data-entry requirements. 
Their empirical studies on two seasons of racing data indicated that 
significant returns on the order of 11% were possible in the place-and- 
show markets. We (Hausch and Ziemba 1985; Ziemba and Hausch 
1987) extended Hausch, Ziemba, and Rubinstein’s (1981) results to 
provide further evidence of the place-and-show inefficiency. We 
(Ziemba and Hausch 1986) and Asch and Quandt (1987) studied ineffi- 
ciency of the exotic markets. Asch, Malkiel, and Quandt (1984, 1986) 
investigated whether a drop in the odds late in the betting period might 
reflect inside information and thereby point to wagers that may have 
positive expected returns. Their results suggest that this is not the 
case, however. A more thorough literature survey is in Thaler and 
Ziemba (1988) and Hausch and Ziemba (1990). The latter also studies 
racing outside of North America. 


II. Inefficiency of the Win Market and the Risk-free Hedging Model 


The literature has demonstrated weak efficiency of the win market at a 
single track, despite a favorite/longshot bias. To test whether this weak 
efficiency is maintained across the win markets with cross-track bet- 
ting, data were collected on several recent Triple Crown races. Al- 
though cross-track betting is becoming more popular, it tends to be 
testricted to major races. The best known of these are the Triple 
Crown races: the Kentucky Derby at Churchill Downs on the first 
Saturday in May, the Preakness Stakes at Pimlico Race Course 2 
weeks later, and the Belmont Stakes at Belmont Park 3 weeks after 
that. 

A simple problem is considered first. Allowing only win betting, 
what is the minimum amount that our bettor must wager to ensure the 
return of $1.00 regardless of which horse wins the race? The solution to 
this problem is, for each horse, to identify the track that has it at the 
longest odds and bet just enough to receive $1.00 if it wins. The solu- 
tion involves no estimation of the horses’ win probabilities. If this 


parameter equal to the inverse of its win probability. Then, any ordering of the random 
variables is just the Harville formulas. Stern also develops alternative ordering formulas 
using gamma distributions that are more accurate but more complicated. 


Chapter 3: Arbitrage Strategies for Cross Track Betting 


105 


Arbitrage Strategies 67 
TABLE 2 Projected Win Payoffs, 1983 Preakness 
$ Amount of 
Highest Win Return Wager That 
Horse No. (on a $1 Bet) Track Will Return $1 
1 29.40 Louisiana Downs .0340 
2 12.70 Louisiana Downs .0787 
3 34.60 Los Alamitos 0289 
4 169.90 Hollywood 0059 
5 56.90 Louisiana Downs .0176 
6 5.70 Louisiana Downs 1754 
7 10.60 Pimlico 0943 
8 76.60 Louisiana Downs 0131 
9 116.10 Hollywood .0086 
10 2.20 Los Alamitos 4545 
11 40.60 Los Alamitos .0246 
Total 9356 


solution were employed at one track, then our bettor would have to pay 
1/Q dollars (an amount greater than $1.00) to ensure a return of $1.00. 
With the opportunity of betting at several tracks, each with a different 
set of odds, we may be able to lower this minimum amount to below 
$1.00. To see how this system works, consider the 1983 Preakness. The 
final win odds were collected from 11 tracks that allowed wagering on 
this race.’ The highest of the 11 win payoffs on horse number 1 was 
$29.40 per dollar wagered at Louisiana Downs. Thus, a win bet of 
$0.0340 there would have returned $1.00 when, in fact, horse number 
1, Deputed Testamony, won the race. This wager and those on the 
other horses are presented in table 2. 

Thus, by wagering $0.9356, our bettor is guaranteed $1.00 regardless 
of who wins the race. This is a certain profit of $0.0644 per $0.9356 
wagered, or a guaranteed return of 6.9% rate of return in a 2-minute 
race, 

Obviously, ‘‘risk-free’’ arbitrage is not possible. Implementing the 
system requires that the track odds be sent to a central decision maker 
several minutes before the end of betting, and during that time the odds 
can change. The system does, however, demonstrate the large dis- 
crepancies in betting across the tracks and shows how simple it can be 
to take advantage of them. Table 3 presents the results of applying the 
system to the data from other Triple Crown races. 

Three of the races had insufficient variance in the win odds across 


7. The number of outlets involved in cross-track betting has increased over the years. 
The 1985 Kentucky Derby had betting at 32 outlets, including New York City’s off-track 
betting (OTB). The 1985 Preakness and Belmont had 28 and 41 outlets, respectively. For 
1988, the number of outlets for these three races were, respectively, 93, 87, and 76. We 
requested the final win, place, and show data from the outlets that we knew had accepted 
wagers. Response rates tended to be low and, in some cases, the data were no longer 
available, having been stored for only a short period. All the data received were used in 
the analysis. 
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TABLE 3 Risk-free Hedging Model Results 
No. Horses No. 
Race in Race Tracks % Profit 
Preakness: 
1982 7 5 0 
1983 11 11 6.9 
1984 10 4 .0 
1985 11 7 2.5 
Belmont: 
1982 11 5 13.6 
1983 12 9 8.5 
1984 It 2 0 
1985 9 i 5.0 
Kentucky Derby: 
1984* 12 7 JA 
1985 12 6 10.1 
Average 4.7 


* Cross-track betting on the Kentucky Derby did not begin until 
1984. 


the tracks to allow a risk-free profit. This is not surprising for the 1984 
Belmont because we had data from only two tracks. Also, the results 
for the 1982 and 1984 Preakness are based on only five and four tracks, 
respectively. In these three races our bettor would obviously make no 
wagers. These three races giving a 0% return together with the other 
seven races have an average risk-free profit of 4.7%. This rate of profit 
is for the certain return of $1.00. As the required certain return in- 
creases, then the wagers begin to affect the odds and this rate of profit 
will decrease.® 


8. This ‘‘certain return’’ scheme can be extended to include place-and-show betting. 
Let Ri be the return on a $1.00 win bet on horse i at track t when i wins. Similarly, let Rj 
be the return on a $1.00 place bet on i at track ¢ when i and j are the top two finishers, and 
let Ri, be the return on a $1.00 show bet on i at track ¢ when i, j, and k are the top three 
finishers. Formulas.(1)-(3), corrected for breakage, determine Ri, Ri; and Ri. The deci- 
sion variables are amounts to wager to win, place, and show on horse i at track t and are 
represented, respectively, by w!, pf, and s/. There is a constraint for each ijk finish that 
requires a return of $1.00, should that finish occur. The following formulation ignores our 
effect on the odds and, while this effect is negligible for a return of $1.00, it should be 
included for larger required returns. The formulation that determines the minimum ex- 
penditure for a certain return of at least $1.00 is 


T n 
minimize > ` (wi + pi + si). 
t=1 i=i 
subject to 


T 
Ñ (Riwi + Rip! + Ripi + Rias! + Rias! + Risi) =1 for each i j, k, 


t=1 
wi, pi, si =0 foralli=1,...,nand¢=1,...,T7. 
This linear program has a large number of constraints; with n horses there are 
n(n—1)(n—2) possible ijk finishes and, consequently, n(n—1)(n—2) constraints. (The 


dual program has 37n constraints, however.) Interestingly, even with the possibility of 
place-and-show betting, betting only to win, as in Sec. III, may be optimal. The 1982 
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IV. The Optimal Capital Growth Model 


Section 11] demonstrated an inefficiency in cross-track betting. We 
now propose and test an “‘‘optimal’’ wagering strategy for this 
inefficiency. Bettors at the racetrack appear to have many different 
objectives and, therefore, employ many different wagering strategies. 
Two quite reasonable long-term objectives, however, are (1) to max- 
imize the expected rate of growth of one’s bankroll, and (2) to minimize 
the expected time to reach some specified large wealth level. Breiman 
(1961) proved that both of these objectives are asymptotically satisfied 
by maximizing, on a myopic period-by-period basis, the expected log- 
arithm of one’s final wealth. This approach is termed the optimal capi- 
tal growth model, and its theoretical justification for the logarithmic 
utility function has been well studied (see Ziemba and Vickson [1975] 
for references and a discussion of its assumptions and results). A simu- 
lation by us (Ziemba and Hausch 1986) suggests that this strategy not 
only performs well asymptotically but over a year of wagering it can 
also be expected to outperform other commonly used betting strate- 
gies. Other attractive features of the capital growth model are that 
one’s effect on the odds can be accounted for and bet size is monotone 
in wealth. On the negative side, though, the recommended bets can be 
very large. Indeed, the Arrow-Pratt absolute risk-aversion index is, 
wealth” ', which is close to zero for large wealth. 

Let wo be our bettor’s initial wealth and q; the probability that i is 
first, j is second, and k is third. Also, let Pj; = P; + Pjand Si, = Sj + $ 
+ S4. The optimal capital growth model is? 


n n 


n 
max > i? 5 dijk log | Wo 


t t ot j= j= = 
Wr S i=1 j=) k=l 
{ Pe a ji kžij 


[or F > vt) = (PE + pi + pi) 
— a 


Preakness (with 7 x 6 x 5 = 210 constraints) is one example of this. The reason for no 
place-or-show betting at optimum seems to be a coordination problem. Win bets return 
one and only one positive amount and these payoffs are mutually exclusive. Thus, a 
return of $1.00 can be efficiently guaranteed with win bets. Place-and-show bets, how- 
ever, tend to have many different possible payoffs, and collecting on them is not mutu- 
ally exclusive. These differences make it difficult to efficiently choose place-and-show 
bets over the win bets. 

9. It is very simple to include exotic wagering in this capital growth model. it was 
formulated to consider only win, place, and show betting, though, because that was the 
only available data for testing the model. 
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pi Pj ) 
x |[— m + | 
TaT Pi + p 


[o(s + st) — (Six + sit spt s)| 
4 


+ 
3 
( si si Sk 
Si + si Si t+ si Si + sh 
n n n 
- (Diwe+ > pe+ > si) 
t=1 t= {=l ’ 
ei {Ai LAP K 
subject to 
T n 
>. (we + pe + se) S Wo, 
t=1¢=1 
we = 0, pe 20,520, t=1,...,7,€ =1,...,2. 


This model requires estimates of the win probabilities for each horse. 
The efficiency studies of the win market have indicated that the pub- 
lic’s win odds, adjusted for the favorite/longshot bias, provide good 
estimates of these probabilities. However, with cross-track wagering 
we not only have a different set of win odds for each participating track 
but, as demonstrated by the Ferdinand example in Section I and the 
risk-free hedging model in Section III, these odds can vary consider- 
ably. Rather than take a weighted average of all the tracks’ odds to 
arrive at a set of win probabilities, we decided to use only the home 
track’s odds.'° This decision was based on a perceived informational 
advantage that the home-track public has over the bettors at the cross 
tracks, an advantage that results from several factors: (1) since these 
races were run near the end of the day’s racing, the home-track public 
had watched the jockeys perform in, perhaps, several races already, 
they had observed the condition of the track and possibly noted any 
track biases, and they saw the horses in the paddock and in the parade 
to post; (2) the home crowd knows better if their track tends to favor 
front-runners or late chargers; and (3) since the home track is usually a 
larger track that has many major races, its public is more likely to have 
seen some of these horses race earlier in the season. Further, the 
studies supporting win-market efficiency have all involved home-track 
odds. Thus, we assume win-market efficiency at the home track and, 


10. This decision was not the result of any analysis of the data. There are only a few 
cross-track races each year and the required data for each race are the final tote-board 
figures from several of the racetracks permitting this betting. These difficulties led to data 
on only 10 races being collected, much less than would be required for any analysis. 
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TABLE 4 1982 Preakness 

Horse Finish Pimlico’s Win Odds Win Probability 
1. Reinvested Sii 7.6-1 .090 

2. Cut Away 3d 41.6-1 O11 

3. Water Bank ves 12.0-1 056 

4. Bold Style wee 26.2-1 023 

5. Laser Light ene 5.3-1 125 

6. Linkage 2d 5-1 597 

7. Aloma’s Ruler Ist 6.9-1 098 


after adjusting for the favorite/longshot bias in figure 1, use the home- 
win odds to estimate win probabilities. Since this win-market efficiency 
is not necessarily maintained at the cross tracks, the model can then 
include possible win betting at the cross tracks. The 1982 Preakness 
Stakes will be used to illustrate this model. Table 4 lists the race’s 
entrants and their odds. 

Aloma’s Ruler’s 6.9-1 win odds meant that, when he won, each 
dollar wagered on him was returned with an additional $6.90 profit. 
Linkage was the crowd’s favorite. His 0.5-1 win odds meant that, had 
he won, he would have returned $1.50 per dollar wagered on him. After 
adjusting the win probabilities for the favorite/longshot bias, the proba- 
bilities were then normalized to sum to one. These probabilities and 
equation (1) adjusted for breakage allow the calculation of the expected 
return on an additional dollar bet to win for each horse i at each track ¢. 
Similarly, the Harville formulas in assumption 1 and equations (2) and 
(3), also adjusted for breakage, allow the calculation of the expected 
returns on place-and-show bets (see Hausch, Ziemba, and Rubinstein 
1981). We received final win, place, and show figures from four of the 
tracks that allowed betting on the 1982 Preakness: Golden Gate and 
Los Alamitos in California, Centennial in Colorado, and Penn National 
in Pennsylvania. The calculated expected returns on the various bets at 
the four tracks are presented in table 5. 

If a track has a payback proportion of, say, 0.85 then the average 
return on $1.00 bets, ignoring breakage, will be $0.85. With breakage, it 
will be somewhat less than $0.85. For individual horses the expected 
returns from table 5 vary from 0.181 (show bet on horse 2 at Golden 
Gate) to 1.336 (place bet on horse 6 at Los Alamitos). Of particular 
interest are the expected returns exceeding 1.00 since those indicate 
the wagers that have a positive expected profit. We restricted our 
attention to wagers with expected returns of at least 1.10. They are the 
highlighted expected returns in table 5. If the probability estimates of 
this model were exact, then this 1.10 cutoff could result in suboptimal 
wagering, particularly since diversification can even lead to the inclu- 
sion of wagers with negative expected profits. Hausch, Ziemba, and 


110 


Calendar Anomalies and Arbitrage 


72 Journal of Business 
TABLE 5 Expected Returns, Cross-Track Betting on 1982 Preakness 
Horse 
1 2 3 4 5 6 7 
Finish as 3 Sass ads snd 2 1 
Win probability .090 O11 .056 .023 125 597 .098 


Expected return on 
a $1 bet to win: 


Golden Gate 837.246 437 -708 -788 -955 1.000 
Centennial .900  .389 370 -570 .825 836 1.147* 
Los Alamitos -666 391 .347 TTT. .713 1.015 1.196* 
Penn National 954  .484 588 1.109* -650 896 755 


Expected return on 
a $1 bet to place: 


Golden Gate .749 193.349 582 669 1.149* .880 
Centennial .769 = 238.260 329 719 1.084 1.120* 
Los Alamitos 794.233, 277 556 778 1.336* .888 
Penn National .673 39939] 737 -586 1.101* 731 


Expected return on 
a $1 bet to show: 


Golden Gate 837 181 405 413 817 1.153* .996 
Centennial 747 197.340 252 .803 1.008 1.138* 
Los Alamitos .890  .200 .392 -341 1.180* 1.293* .873 
Penn National .710  .235 451 .388 .769 1.099 .793 


NoTtE.—ln this table, horses 2, 6, and 7 are Cut Away, Linkage, and Aloma’s Ruler, respectively. 
* Highlighted expected returns (returns of at least 1.10). 


Rubinstein (1981) found, however, that due to the approximations in 
the model it was more profitable to wager only if the expected return 
was well in excess of 1.00. For large tracks and races of the quality of 
these Triple Crown races, a minimum expected return of 1.10 is rea- 
sonable. 

The capital growth model was run on the data from these cross 
tracks. Table 6 presents the optimal portfolio of wagers assuming a 
1.10 expected return cutoff and an initial wealth of $2,500. This port- 
folio has wagers totaling $2,437 at all four tracks and its certainty 
equivalent can be calculated as $432. Since the first three finishers of 
this race were horses 7, 6, and 2, our bettor would collect on the win, 
place, and show bets on horse 7 and the place and show bets on horse 
6, for a return of $3,716.90 and a profit of $1,279.90. The payoffs in this 
table account for our bettor’s effect on the odds. For example, the 
actual payoff per $1.00 bet to show on horse 7 at Centennial was $2.70. 
However, our bettor’s $83 show bet at this track would have lowered 
this payoff to $2.50. The latter payoff was used to determine our bet- 
tor’s return. 

Table 5 supposes only $1.00 is wagered. Thus, even though one track 
may have a higher expected return to, say, show on some horse than 
another track has, it can be that our bettor will make show wagers at 


Chapter 3: Arbitrage Strategies for Cross Track Betting 


Arbitrage Strategies 73 
TABLE 6 Optimal Capital Growth Wagers, Cross-Track Betting on 1982 
Preakness 
Expected $ Payoff $ Total 

Horse Bet Track Return $ Bet per $1 Bet Return $ Profit 
4 Win P.N. 1.109 14 ane eae ~ 14.00 
7 Win L.A, 1.196 40 12.00 480.00 440.00 
6 Place L.A. 1.336 855 1.50 1282.50 427.50 
7 Place Cen. 1.120 46 3.30 151.80 105.80 
5 Show L.A. 1.180 172 ERR si — 172.00 
6 Show G.G. 1.153 571 1.30 742.30 171.30 
6 Show L.A. 1.293 656 1.30 852.80 196.80 
7 Show Cen. 1.138 83 2.50 207.50 124.50 

Totals 2,437 3,716.90 1,279.90 


Note.—P.N. = Penn National; L.A. = Los Alamitos; Cen. = Centennial; G.G. = Golden Gate. 


both tracks because of the bettor’s effect on the odds. This happened at 
Golden Gate and Los Alamitos. Horse 6 had expected returns of 1.153 
and 1.293 at these tracks and the optimal show bets were $571 and $656 
at them, respectively. With these wagers, it must be that the addition to 
expected utility from an additional dollar bet to show will be the same 
at the two tracks.!! 

The capital growth model was studied with cross-track data on the 
other Triple Crown races. In each case it was assumed that the initial 
wealth was $2,500. The results of these races and the 1982 Preakness 
are given in table 7. 

Despite losses on two of the races, including a huge loss on the 1984 
Preakness, there was a total profit of $2,647.80 for a 15% return on 
money wagered. Average profits were $264.78. The standard error of 
the mean is $342, so no statistically significant statements can be made 
about positive profits on the basis of these results. Studying the ex- 
pected returns suggests that many of the profitable overlays occur 
because of regional biases. For instance, Conquistador Cielo, the win- 
ner of the 1982 Belmont Stakes, raced his entire career on the East 
Coast. The West Coast bettors were less familiar with him, while on 
the East Coast he was, for many, a sentimental favorite. In the Bel- 
mont Stakes he tended to be sent off at lower odds at the East Coast 
tracks. When he won he paid from $5.80 to win at Commodore Downs 
in Pennsylvania to $15.40 at Los Alamitos in California. Another ex- 
ample is Tolomeo, the English colt that won the 1983 Arlington Mil- 
lion. His odds were 4-1 in England and 38-1 at Arlington Park in 


11. This marginal utility can be complicated because a component of the expected 
return on an additional dollar bet to show on horse 6 at Los Alamitos is the positive effect 
that it has on the return to the show bet on horse 5 at Los Alamitos if 6 finishes out of the 
money. 


TABLE 7 Results of Optimal Capital Growth Model on Triple Crown Races 
Total Certainty 
No. Horses Wagers Equivalent No. Wagers Return Profit 
Race in Race No. Tracks No. Wagers ($) ($) Won ($) ($) 
Preakness: 
1982 7 4 8 2,437 432 6 3,716.90 1,279.90 
1983 11 8 13 1,949 325 1 1,647.30 — 301.70 
1984 10 3 8 2,282 371 0 0.00 — 2,282.00 
1985 11 6 11 1,817 325 4 2,014.40 197.40 
Belmont: 
1982 I} 4 20 942 305 5 1,880.40 938.40 
1983 12 7 20 2,452 469 6 2,578.80 126.80 
1984 11 2 3 1,371 317 1 2,331.00 960.00 
1985 9 9 15 471 73 1 78.30 — 392.70 
Kentucky Derby: 
1984 12 6 13 2,027 340 5 3,027.20 1,000.20 
1985 12 6 7 1,973 350 3 3,094.50 1,121.50 
Totals 118 17,721 3,307 32 20,368.80 2,647.80 
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Illinois. This observation and table 7 suggest that a high expected 
return is possible even with only a few carefully selected cross tracks. 

The Triple Crown races are the most widely publicized of the North 
American horse races. Thus, one might expect that other less pub- 
licized cross-track races would have even more divergence in the odds 
across the tracks. If so, they should have an even greater potential for 
expected profits. 

A major assumption of this model is that after all other bettors have 
made their wagers our bettor (1) learns the tote-board information at 
each track, (2) runs the capital growth model, and (3) communicates 
the optimal wagers to agents at each track, who then make the wagers. 
Obviously this is extremely difficult, if not impossible, in practice, 
never mind any legal concerns. Just learning the tote-board informa- 
tion at each track is difficult because there are no pay phones inside the 
racetrack grounds. A central decision maker and the agents communi- 
cating with cellular phones is feasible but still requires a significant 
amount of time. Unfortunately, though, odds may change in the last 
few minutes of betting, and profitable bets a few minutes before the end 
of betting may not be profitable based on the final odds. Hausch, 
Ziemba, and Rubinstein (1981) studied the odds changes in the last 2 
minutes of betting and found that expected returns did change some- 
what but profitable place-and-show bets 2 minutes from the end tended 
to remain profitable based on final odds. However, the agents in this 
model, because of its extra complications, would probably have to 
report odds more premature than those 2 minutes before the end of 
betting. The results given in this article, then, may overestimate the 
profits possible in practice. Additionally, our profit figures do not ac- 
count for the costs of implementing this procedure—the agents and 
other costs at each cross track, the long-distance phone calls, and the 
computer time. 


V. Testing the One-Track Capital Growth Model 


Implementing the previous section’s capital growth model is certainly 
difficult. However, when the race is televised from the home track 
there are simpler versions of this scheme possible for one bettor at one 
track. Our bettor, with a portable television at the cross track, can 
view the home odds when they are shown on TV. With these odds 
giving “true” win probabilities, our bettor can search for overlays at 
the cross track. A flat bet to win could be made on horses going off at 
longer odds than at the home track, or a sophisticated bettor could also 
bring a portable computer to the track and run the capital growth model 
described in Section IV for one track, that is, T = 1. This latter scheme 
is tested here with the Triple Crown race data. 

As an example, consider the 1984 Kentucky Derby simulcast at 
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TABLE 8 Optimal Wagers Based on One-Track Capital Growth Model, 1984 
Kentucky Derby 
Wager Expected $ Payoff on $ Realized 
Type Horse Return $ Optimal Bet a $1 Bet Return $ Profit 
Win 2 1.236 15 sae ae ~ 15.00 
Win 10 1.159 ae 5.60 ates 
Place 2 1.153 aan oe Sau aor 
Place 10 1.410 249 3.70 921.30 672.30 
Show 2 1.353 261 wee hs — 261.00 
Show 10 1.275 295 2.50 737.50 442.50 
Totals 820 1,658.80 838.80 


Golden Gate Fields in Albany, California. Using the win odds televised 
from Churchill Downs, Golden Gate had six wagers with expected 
returns exceeding 1.10. With a wealth of $2,500, the capital growth 
model yields the optimal wagers shown in table 8. 

This portfolio has four wagers totaling $820. The win bet on horse 
number 10 and the place bet on horse number 2 are zero even though 
they have expected returns exceeding 1.10. This is because the possi- 
bility of number 2 doing well in the race is better accounted for with the 
higher returning win-and-show bets on him. Also, the possibility of 
number 10 doing well is better accounted for with the higher returning 
place-and-show bets on him. Swale, number 10, won the 1984 Derby, 
followed by Coax Me Chad and At The Threshold for a 10-12-9 finish. 
Therefore, only the place-and-show bets on Swale paid off for a return 
of $1,658.80 and a profit of $838.80. 

Table 9 presents the results of this one-track model on the other 
races. The average wagers on a race varied from $78.33 to $1,173.25 
and the average profits varied from — $868.67 to $824.10. The average 
of these 10 average profits was $69.97 or 9.2% on the money wagered. 
Again, there is such variability in the profits that, without additional 
data, no statistically significant statements can be made about positive 
expected profits. 


VI. Final Discussion 


There is considerable evidence that the win market at the racetrack is 
weakly efficient. A risk-free arbitrage model is presented that demon- 
strates that this is not the case with cross-track wagering. The mispric- 
ing by the public at the cross tracks may be due partly to their more 
limited access to information relative to those attending the home 
track. Also, some of the variance in the odds across the tracks may be 
due to certain horses being more familiar to bettors in certain regions of 
North America. The optimal capital growth model suggests that the 
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TABLE 9 Results of One-Track Capital Growth Model on Triple Crown Races 
Average Average Average 
No. Cross-Track Cross-Track Cross-Track 
of Cross Wager Realized Return Profit 
Race Tracks ($) ($) ($) 
Preakness: 
1982 4 1,173.25 1,724.80 551.55 
1983 8 909.25 353.79 — 555.46 
1984 3 868.67 .00 — 868.67 
1985 6 799.67 789.37 — 10.30 
Belmont: 
1982 4 407.25 639.02 231.78 
1983 7 532.71 488.63 — 44.08 
1984 2 1,158.00 1,982.10 824.10 
1985 9 78.33 7.80 — 70.53 
Kentucky Derby: 
1984 6 520.67 884.92 364.25 
1985 6 1,161.33 1,438.42 277.08 
Average 5.5 760.91 830.88 69.97 


discrepancies across the tracks can allow profits, but further work is 
needed to demonstrate significant profits. A simpler version of the 
optimal capital growth model for one bettor at one cross track also 
demonstrates the possibility of profit. 
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The folklore of investment is replete with stories of arbitrage 
opportunities where profits can be made without risk. Such a 
“lock” exists at the racetrack. A simple model provides a cri- 
terion for existence of a set of bets to create the arbitrage plus 
the size of the various investments. 


Ro return only about 82 per- 
cent of the money wagered to the 


winning bettors. Thus, the average bettor 
loses about 18 percent of his or her wag- 
ers. There are two parts to this loss: (1) 
the track take is the predetermined per- 
centage of the betting pool that is kept by 
the track; and (2) breakage is the process 
of rounding down the payoffs on winning 
bets to common payoff amounts. Handi- 
cappers have proposed many systems for 
beating the track but an 18 percent disad- 
vantage is a formidable hurdle to over- 
come. Hausch, Ziemba, and Rubinstein 
[1981] and Hausch and Ziemba [1985] de- 
vised a method that allows the average 
player to win by utilizing the betting bias 


423 


of the players along with an investment- 
decision mode! to determine when and 
how much to wager. The method works 
well, but it is far from risk free. This is 
also true of alternative approaches that 
are discussed by Asch, Malkiel and 
Quandt [1984, 1986], Thaler and Ziemba 
[1988] and Hausch and Ziemba {forth- 
coming a]. However, there are situations 
at the track where it is possible to con- 
struct a risk-free hedge, or in the racing 
vernacular, a lock. The only publications 
we know of that concern locks are Willis 
[1964], Leong and Lim [1989], and 
Hausch and Ziemba [forthcoming b]. The 
latter two deal with cross-track betting 
where a number of tracks each offer 
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odds, possibly very different, on the 
same race. While it may be possible to 
make risk-free profits with cross-track 
betting, it involves considerable overhead 
at the various locations, and the bettors 
may encounter legal problems with com- 
municating the information. Willis [1964] 
discusses a much cleaner situation with 
all activity concentrated on one race at 
one track. When the public's wagers in 
the win market (one collects if one’s 
horse is first) and the place market (one 
collects if one’s horse is first or second) 
are very different, Willis shows how arbi- 
trage between the two markets may be 
possible. However, it would be extremely 
rare for the odds in the two markets to 
vary sufficiently. In fact, Willis supplies 
no actual examples. Our lock concentrates 
on another market, the show market. 

One collects on a show bet if the horse 
finishes first, second, or third. Let S, rep- 
resent the public’s show bet on horse i 
and let Q be the track’s payback propor- 
tion (about 0.80 to 0.86). Finally, let n be 
the number of horses and let S = 2 S; be 
the show pool. Then if horses i, i “and k 
finish 1-2-3 in any order, the payoff per 
dollar wagered on i is 
14 QS - §,- S - Sk 

35S; 
The ticket holders of the first three horses 
are repaid their original bets plus a share 
of the profits. In practice, breakage 
rounds this payoff down to the nearest 5 
cents or, more commonly, 10 cents. With 
INT[Y] giving the largest integer not ex- 
ceeding Y, the formula for the payoff in- 
ee the effect of breakage is 
INT Í NIQS — S; - S,- S4 
3S; 


(1) 


1 2 


1+4 
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10 or 20 for 10 cents or 5 
cents breakage, respectively. If this payoff 
is less than $1.05 for each dollar wagered, 
then most tracks guarantee a return of 


where N = 


$1.05, that is a five percent profit. 
Situations where the track must honor 
this guarantee of a five percent minimum 
payoff are called minus pools. They result 
in the track collecting less than (1—Q)S, 
and hence, tracks try to avoid them 
whenever possible. However, they are not 


Our lock concentrates on the 
show market. 


uncommon. We focus here on a particular 
type of minus pool, one where a heavy 
favorite has about 95 percent of the show 
pool bet on it, that is where S, = 0.955. 
The crowd figures that this horse, or 
group of horses if it is a betting entry, is 
so good that it almost surely will finish at 
least third. In such a situation, one can 
construct a lock in the show pool. 

Locks were described in articles in 
Sports Illustrated [Gelband 1979] and For- 
tune [Seligman 1979] using the example in 
Table 1, the Alabama Stakes at Saratoga 
on August 11, 1979. 

Davona Dale’s 95.5 percent share of the 
show pool is unusually high and has cre- 
ated a minus pool. If Davona Dale fin- 
ishes in the money, that is, at least third, 
then the show payoffs on the first three 
finishers will be the minimum $1.05 per 
dollar bet. Here is how a lock can be de- 
vised: If Davona Dale is in the money, 
then we will receive five percent on the 
money we wagered on her plus five per- 
cent on the money we wagered on the 
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Win Odds Win Show % of 

Horse to $1 Probability Bet Show Pool 
Davona Dale 0.30 0.661 $435,825 95.5 
It's in the Air 3.10 0.210 7,901 1.7 
Mairzy Doates 13.40 0.050 4,518 1.0 
Poppycock 17.50 0.037 4,417 1.0 
Croquis 15.40 0.042 3,873 0.8 
1.000 $456,534 100.0 


Table 1: The public's win odds and show bets are given for the 1979 Alabama Stakes at 
Saratoga. The conversion of win odds to win probabilities accounts for the public’s biases 
(see Ziemba and Hausch [1986] and Hausch and Ziemba [forthcoming b]). 


other two horses that finish in the money. 
If this adds up to more than the wagers 
we lost on the fourth and fifth place 
horses, then we are ahead. If, as well, the 
amounts we wagered on the four long- 
shots are such that, if Davona Dale fin- 
ishes out of the money, our return covers 
both the bet on her and on the other out- 
of-the-money horse, then we have a profit 
regardless of the outcome of the race. A 
lock is clearly a very conservative betting 
strategy; it is consistent with a utility 
function that has infinite disutility for any 
losses. Later we will discuss the logarith- 
mic utility function for comparison, but 
for now we will describe the conditions 
that are necessary for a lock and recom- 
mend wagers on the horses to develop a 
lock. 

Suppose we wish to receive approxi- 
mately the same profit regardless of the 
finish. Initially, let us assume that (1) ex- 
cept for the favorite, the public wagers 
the same amount on each horse to show, 
and (2) our bets do not affect the odds. 
With k as the fraction of the show pool 
on the favorite, the show bet on the favor- 
ite is kS and the equal show bets on the 
other horses are (1—k)S/(n—1). Let x be 
our wager on the favorite and y be our 


show bet on each of the other horses, for 
a total wager of x+ (n — 1)y. If the favorite 
is in the money then we collect five per- 
cent on x and two of the y bets, but lose 
(n—3)y on the n—3 losers, for a profit of 
.05(x + 2y) —(n— 3)y. If the favorite is out 
of the money, then, since it has been sup- 
posed that our wagers do not affect the 
odds, our profit is 


Os: = 3(1 —k)S 
3 ee y 
3 (1 ~kyS/(n — 1) 
— x — (n-4)y. 


The first term is the profit on the three 
horses that finished in the money, and it 
follows from expression (1). The next two 
terms are the losses on the favorite and 
on the —4 other horses. 

To guarantee a particular return regard- 
less of the finish, these two profit func- 
tions must be equal. This means x and y 
must satisfy the following ratio: 
xly = —2 + Q(n—1)/{1.05(1 —k)]. (3) 
This x/y ratio yields profit of .0S5yQ(m— 1)/ 
[1.05(1 —k)] - (n—3)y, and this must be 
positive for a lock to exist. This holds 
when 
k>1 - Q(n-1)/[21(n—3)). 

Our example does not satisfy the first 


(4) 
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assumption: the bets on the long shots are 
not equal; they range from $3,873 to 
$7,901. The second assumption will hold 
only if our wagers are relatively small. 
Despite this, Q=.85 and n=5 for our ex- 
ample, so condition (4) indicates that a 
lock will exist if k exceeds 0.920. Davona 
Dale’s k=0.955 so we can devise a lock. 
Equation 3 gives x/y = 69.96, so with 
$2,500 to bet we would wager x= $2,364 
on the favorite and y= $34 on each of the 
other four horses. 

If Davona Dale finished in the money, 
then our profit would be $53.60. If the 
public’s show bets on the other four 
horses had been the same, then the profit 
would also have been $53.60 even if 
Davona Dale finished fourth or fifth: a 
guaranteed 2.1 percent return. The fact 
that the show bets on the four other 
horses are different will not affect our 
profit of $53.60 if Davona Dale finished at 
least third. If Davona Dale finished fourth 
or fifth, however, our profit would de- 
pend on which of the four horses were 
the top three finishers. Table 2 lists the 
profit for the four possible permutations 
of these four horses being the top three 
finishers. If the four profits are weighted 
by their likelihoods, the average profit is 
about $53.60. 

In this analysis, we supposed that our 
bets do not effect the payoffs. For this ex- 
ample and a total wager of $2,500, this 


Top Three Finishers (order does not matter) 


It’s in the Air — Mairzy Doates — Poppycock 
It’s in the Air — Mairzy Doates — Croquis 
It’s in the Air — Poppycock — Croquis 
Mairzy Doates — Poppycock — Croquis 
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assumption has been fine. In reality, the 
effect of our wages on the payoffs would 
reduce each of the profits in Table 2 by 
less than $7.00 and have no effect on our 
$53.60 profit if Davona Dale was in the 
money. The effect on the payoffs of a 
much larger total wager is more serious 
though. For example, betting a total of 
$25,000 cannot be expected to allow a 
guaranteed profit of about $536.00. 

Equation 4 shows that the condition for 
a lock is more easily met when n is small 
or Q is large or both. Unlike most states 
which have a guaranteed minimum five 
percent return, Louisiana has a minimum 
return of 10 percent. Equation 4, revised 
for Louisiana, yields the less restrictive 
lock condition, k > 1 — Q(n-—1)/ 

[11(n —3)]. If Q = .85 and n = 5, as they 
do in our example, then k = .846 is suffi- 
cient for a lock. 

Conditions (3) and (4) assumed that the 
public wagers the same amount on each 
of the nonfavorite horses. It is possible to 
treat the case where these amounts are 
different, as they are in this Alabama 
Stakes example. Let x continue to be our 
wager on the favorite, Davona Dale, but 
now let y, Yz} y3, and y, be our wagers on 
It’s in the Air, Mairzy Doates, Poppycock, 
and Croquis, respectively. Let R be our 
guaranteed return. Table 3 shows the re- 
turn on each horse for each possible tri- 
plet of winners. The linear program for 


Profit 


$ 12.60 
148.60 
172.40 
597.40 


Table 2: If Davona Dale finishes out of the money, then profit depends on the identity of the 
top three finishers. The order in which they finish does not matter, though. 
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maximizing the guaranteed return to a 
$2,500 bankroll is 

Maximize R 

subject to 

x+ yı + y + y; + y, = 2500 


O5(x + y, + y) — 4 ~Y ZR 
O5(x + y, + ys) — y- yR 
O5(x + y, + Ys) -yY YVR 
O5(x + y + y) >y — Ww ZR 
O5(X + y + y) —Y¥r yR 
O5(x + y + Ys) — ¥2 — ¥2 ZR 


15.60 y, + 27.30 y, + 28.00 y; — x - y; 
2R 

15.60 y, + 27.40 y, + 31.90 y, — x - y; 
2R 

15.60 y, + 28.00 y; + 32.00 y - x ~ 4: 
=R 

27.60 y, + 28.30 y; + 32.20y, —- x — y 
2R 

X, Yir Yr Yar Ya = O. 

The solution is R* = $52.47 with x* = 

$2366.04, y* = $34.54 for i=1,2,3, and y,* 
= $30.34. So, the wagers on the horses 

are close to those made in the equal- 

public-wagers case. Constraints 2, 3, and 

5 have surpluses of $4.41 while con- 

straints 10 and 11 have surpluses of $23.75 

and $454.66, respectively. The remaining 

constraints are binding. 

The lock condition is seldom met, and 
to get a substantial return one needs a 
large bankroll. However, the times when 
it is met should not be complete sur- 
prises; it happens when there is an ex- 
treme favorite that the crowd figures 
cannot be out of the money. Davona Dale 
allowed another lock in the 1979 Coaching 
Club American Oaks at Belmont. She 
won that race with 97.8 percent of the 
show pool on her! Two years after Davona 
Dale won that race, the entry of Heavenly 
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Cause and De La Rose had 95.2 percent 
of the 1981 Coaching Club American 
Oaks show pool. Spectacular Bid ran 30 
races and finished out of the money only 
once, and that was as a two-year-old. He 
was such a standout that he often went 
off at 1-20 odds, a good sign that a lock 
may exist. One lock on him was the 1980 
Amory Haskell Handicap where he had 
96.0 percent of the show pool. A more re- 
cent example is Easy Goer’s impressive 
win in the 1989 Gotham Stakes at Bel- 
mont Park. Easy Goer had 97.1 percent of 
the show pool of $553,658. We thank 
Peter Arnold for bringing this lock to our 
attention. 

If a horse that has virtually all the 
show money bet on it loses, the payoffs 
can be very high. In fact, show payoffs 


The show payoff on Arbor 
Hoggart is thought to be the 
highest show payoff of any 
sort. 


can exceed win payoffs. This happened 
when the public’s favorite, Kassa Branca, 
finished last in the New Jersey Futurity at 
Freehold Racetrack on October 22, 1988. 
Two bettors had wagered $50,000 and 
$150,000 to show on him. Others brought 
the show bet on Kassa Branca up to 
$213,837. That amounted to 98.8 percent 
of the show pool of $216,492. The payoffs 
per two-dollar wager were 


WIN PLACE SHOW 

Beta Bob 41.20 11.20 312.40 
Nukes Image 7.00 113.00 
340.80 


Arbor Hoggart 
The show payoff on Arbor Hoggart is. 


thought to be the highest show payoff of 
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Probability of Respective 
Top three horses these being the show payoffs 
top three horses per $1 bet 
Davona Dale-It’s in the Air-Mairzy Doates 0.34468 1.05, 1.05, 1.05 
Davona Dale-It’s in the Air-Poppycock 0.25139 1.05, £.05, 1.05 
Davona Dale-It’s in the Air-Croquis 0.28692 1.05, 1.05, 1.05 
Davona Dale-Mairzy Doates-Poppycock 0.03589 1.05, 1.05, 1.05 
Davona Dale-Mairzy Doates-Croquis 0.04109 1.05, 1.05, 1.05 
Davona Dale-Poppycock-Croquis 0.02970 1.05, 1.05, 1.05 
It's in the Air-Mairzy Doates-Poppycock 0.00330 16.60,28.30,29.00 
It's in the Air-Mairzy Doates-Croquis 0.00376 16.60,28.40,32.90 
It’s in the Air-Poppycock-Croquis 0.00274 16.60,29.00,33.00 
Mairzy Doates-Poppycock-Croquis 0.00053 28.60,29.30,33.20 
1.00000 


Table 3: The probability of any three horses being the in-the-money finishers is given, along 
with the resulting show payoffs. These probabilities were computed using the Harville [1973] 
formulas. The probability of an i, j, k finish, where q,is the probability of winning, is 9,4,9,/ 
[1 —4) (1—q4,—q,)]. Stern [1987] discusses alternative probability models (see also the 
discussion in Hausch and Ziemba [forthcoming al). 


any sort. The previous harness racing re- 
cord was $296.00 in 1986 at Dover 
Downs. This extreme example would 
have allowed a lock returning close to 
four percent. (Many thanks to Pete Asch 
for pointing out this race to us). Other ex- 
amples are mentioned in Ziemba and 
Hausch [1987]. 

Since the lock strategy requires a guar- 
anteed profit, it is much more conserva- 
tive than the optimal capital growth 
strategy described in Ziemba and Hausch 
[1987]. The optimal capital growth strat- 
egy asymptotically maximizes the rate of 
growth of one’s bankroll. This is achieved 
by maximizing, in a myopic race-by-race 
fashion, the expected value of log utility. 
A comparison of these two strategies can 
be made with the Alabama Stakes race. 
Table 3 shows possible payoffs on each 
horse and their likelihoods, and Table 4 
gives the expected return to a show bet 
on each horse. 

Only the show bets on Davona Dale 


and It’s in the Air have positive expected 
profits. With an initial wealth of $2,500, 
the optimal capital growth bets are 


Davona Dale $2,294 
It's in the Air 203 
Mairzy Doates 0 
Poppycock 0 
Croquis 3. 


The entire $2,500 is wagered, and regard- 
less of the order of finish our bettor does 
not go bankrupt. The certainty equiva- 
lent, that is, the certain return giving the 
same utility as the expected utility of the 
gamble using the logarithmic utility func- 
tion, is $101. The wager on Croquis has 
an expected return of only $0.607 on the 
dollar. While it is unlikely that both 


Horse Expected Return 
on a Show Bet 

Davona Dale 1.039 

It's in the Air 1.090 

Mairzy Doates 0.658 

Poppycock 0.524 

Croquis 0.607 


Table 4: The expected return to show is 
calculated for each horse. 
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Top three horses Profit 
Davona Dale-It’s in the Air-Mairzy Doates $121.85 
Davona Dale-It’s in the Air-Poppycock 121.85 
Davona Dale-It’s in the Air-Croquis 125.00 
Davona Dale-Mairzy Doates-Poppycock — 91.30 
Davona Dale-Mairzy Doates-Croquis — 88.15 
Davona Dale-Poppycock-Croquis — 88.15 
It’s in the Air-Mairzy Doates-Poppycock 808.20 
It’s in the Air-Mairzy Doates-Croquis 908.20 
It's in the Air-Poppycock-Croquis 908.20 
Mairzy Doates-Poppycock-Croquis — 2,399.80 


Table 5: Using the optimal capital growth wagers, profits are given for the possible trios of 
horses in the money. The profit is not affected by the order of the three, though. 


Davona Dale and It’s in the Air will finish 
out of the money, Table 3 shows that, in 
that case, Croquis has a better payoff 
than Mairzy Doates and Poppycock. Table 
5 presents the possible profits from these 
wagers (accounting for their effect on the 
odds). 

Tables 3 and 5 show that the chance of 
a loss is 10.7 percent and the expected 
profit can be calculated to be $106.28, or 
4.25 percent on the bankroll. Clearly, this 
expected return is considerably higher 
than the lock’s certain 2.1 percent return. 
The actual finish, Poppycock-Davona 
Dale-It’s in the Air, yielded a profit of 
$121.85. 

Such a lock exists maybe five to ten 
times a year in North America. While you 
are enjoying the performance of a super 
horse, you may make some profit as well. 
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Abstract 


We present arbitrage and risk arbitrage betting strategies for team jai alai. Most of the 
results generalize to other sports betting situations and some financial market applica- 
tions. The arbitrage conditions are utility free. The risk arbitrage wagers use the Kelly 
expected log criterion. 


Keywords: arbitrage, risk arbitrage, hedging, sequential investing 


1. INTRODUCTION 


This chapter discusses arbitrage and risk arbitrage strategies for betting on team Jai 
alai.! The game originated in the Basque region of Spain and is played in Mexico City, 
Connecticut, Florida, Nevada, Rhode Island, and other locales (see Hollander and 
Schultz, 1978). It is played in a large enclosed rectangular court called a fronton 
between two opposing teams, each having two players. Players serve each point in turn 
and single points are scored by one team winning a rally off the serve, as in squash, 
racquetball, or tennis. Opposing teams alternate hurling and catching a ball (the pelota 
made of goatskin and hard rubber which must be recovered every 15 min of play) with 
an enlarged basket (cesta) against a wall (granite or concrete). When one team misses, 
the other team scores a point. The game is fast and exciting. Games are usually played 
to 30 points. At the fronton, bets may be placed on either team to win the game before 
every point is played at fixed locked-in odds, until the outcome of the game. Payoffs on 
bets made during the game are settled at the end of the game based on the quoted house 
odds at each betting point. We construct arbitrage and risk arbitrage bets with zero or 
little risk while at the same time yielding a positive return. 

Arbitrage occurs in strategies when the net gain of all bets is always non-negative 
and sometimes positive and involves no risk of losing. Conditions that lead to arbitrage 
in various circumstances are studied in Kallio and Ziemba (2007). Risk arbitrages may 
yield losses, but occur more frequently and have higher mean returns. We develop these 
arbitrages for team jai alai. Section 2 provides conditions for arbitrage. Risk arbitrage 
is discussed in Section 3. Final remarks and applications to other areas are discussed in 
Section 4. 

Assume that 


1. The jai alai fronton bet payout rate is the constant Q € (1,0). 

2. The two teams’ relative ability is known and defined by the probability of win- 
ning a single point—team A wins with the score invariant probability p, and B 
with q = 1 — p. 


The probability of A reaching K points before B, given that A currently has 
O<m<K points and B has 0<n< K points, is (according to Montmort, see 


! Goodfriend and Friedman (1975, 1977) and Skiena (1988) have analyzed the game of individual jai alai. 
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Epstein, 1977, p. 109): 


P, = P,(m, n) 
= px | +k- m+ p EE mt) me SAED 
Ken-t (2K-m-n-2)! 
(K-m-1IXK-n-1)! 


K-n-] 
K-m+i-1 
— »,K-m i 


For K = 30, P, is the probability that team A will win the game given the current score 
is m to n. A schedule of P, and P, for all values of m and z over the 30 point game in 
abbreviated form appears as Table 1 for the case of p = 0.5. 

For given fixed Q, a schedule, as shown in Table 2, can be computed from the 
consistent house odds over the game since 


tee tg 


P, = Q/(O, + 1) and P, = Q/(O; + 1) or 


Q Q Q-P, 
——— oe es : 1 
(0) P, 1 and O, P, P, (1) 


In Equation (1) O, and O; are the consistent house odds for teams A and B respectively, 
when the score is (m, n). Odds of 1.5 to 1 means 1.5 profit plus the original bet or 2.5, 
is returned for each 1 bet, and so on. Consistent odds are those that return 1 — Q for 
the house’s profit regardless of which team, A or B, wins. Then the expected return per 
dollar bet on each team is Q. Since P, + P, = 1 independent of Q, the odds then reflect 
the actual value of Q through Equation (1). Hence 


_ Q- PaXQ — Pr) =,;-2@0-9) 


O ph A 
eO; P,P, P,P, 


(2) 


For given Q and odds on A of O, to 1, consistent odds on B may fail to exist that 
guarantee the house advantage 1 — Q. In that case, if there is a minimum payout 
of 1, that is, you just get your money back, then there is a minus pool and the house’s 
actual take is less than 1 — Q. A minus pool is defined to be this situation where the 
house take is 1 — Q* < 1 — Q, where Q* > Q. So the effective Q, namely Q*, is higher 
and the odds on B always exist since at their lowest the odds on O, are the reciprocal 
of the odds on A. That is the case where the payback is all the money wagered and 
the house makes no profit since Q* = 1. In general, these odds are possibly as low as 
the minimum guarantee, that is, O, = 1/O,. In a typical minus pool, the odds on B 
are higher than 1/O,. For example, if the minimum payout is 1.05, as is typical, then 
consistent odds on O, may fail to exist. In this case, the odds on A are too large given 
the 1 — Q* demanded by the minimum guarantee so that the odds on B do not exist. 
Let O, be Q = 1 consistent odds (i.e., odds that give Q = 1) and Oy, are related by 


TABLE 1 Probability that Team A Wins When the Score Is A = m and B = n and the Single Point Probability is p = .5 

a/n o 6% 2 3 4 5 6 7 8 9 30 44 12 $3 14 9S 16 97 98 19 20 21 22 23 24 25 26 7 20 29 

° +80 .55 .60 .65 .70 .75 .79 .83 .B7 .90 .92 .94 .9% .97 698 199 199 

1 245 2.50 .SS .61 .66 .71 .76 .80 .84 .67 .90 .93 .95 .96 .96 .98 .99 .99 

2 +40 .45 .50 .55 .61 .66 .71 176 .80 .86 .88 .91 .93 .95 .97 .98 .99 .99 

3 +34 139 .45 .50 «56 161 666 .72 276 .8t .85 .88 .91 194 .96 197 .98 .99 99 

4 -JO «34 139 144 050 156 .6t 067 272 1.77 .81 185 189 .92 .94 .96 197 1.98 .99 .99 

5 «25 629 .J4 .39 .&4 .50 .56 .6} .67 .72 .77 .82 .86 .89 .92 .95 .96 .98 .99 .99 

6 -21 124.29 134 039 044 150 .56 .62 .67 .73 .78 .83 .87 .90 .93 .95 .97 .98 199 .99 

1 -17 .20 .24 .28 .33 .39 .44 .5O .56 .62 .68 .?3 .79 .83 .87 .93 .93 .96 .97 .90 .99 .99 

8 243.16 20 124 .28 .33 238 144 150 .56 .62 .68 .74 .79 .84 .88 191 194 196 097 .99 199 

9 230 213 216 239 423 128 239 038 .44 150 .S6 .63 .69 274 60 .64 .89 192 .94 .96 .98 199 99 

10 -0G 610 212 .15 219 123 227 632 238 144 250 256 163 .69 .75 1.60 .85 .89 193 19S .97 98 299 

n -06 07 .09 .12 .15 .18 .22 .27 .32 .37 .44 .50 .$7 .63 .70 .76 .Bi .B6 .90 .93 .96 .97 .99 .99 

2 204 0S 607 09 210 214 217 421 26 031 .37 143 .SO .57 .64 .70 .76 .B2 287 091 .94 .9%6 196 .99 .99 

3 203 D4 .05 .06 06 .1) .13 .17 .21 .26 .3t .37 .4) .50 .57 .64 271 .77 .83 .88 .92 .95 .97 98 .99 

14 202 .02 203 .04 .06 .08 410 .13 .16 .20 «25 .30 36 -43 050 .S7 264 271 278 64 89 192 .95 197 199 199 

15 -Ot .02 .02 .03 .04 .05 .07? .09 .12 .16 .20 .24 .30 .36 .43 .SO .57 .65 .72 .?9 .85 .89 .93 .%6 .98 .99 

16 +01 Ot 01 02 .03 O4 605 .07 .09 .11 615 .19 24 .29 .36 .43 050 S56 16S .73 .BO .86 .9) .94 ,9? 98 199 

7 .0) .01 .01 .02 .02 .03 .04 .06 .06 .31 .14 218 .23 129 035 142 50 .SH .66 .74 .81 .87 .92 .95 .98 199 

8 +O) 01 Ot 02 .03 .04 .06 .07 .10 .t3 .?? ,22 .28 .35 .42 .5Q .S8 .67 .75 .82 .88 .93 .96 .98 .99 

19 201 .01 01 .02 .03 .04 .05 .07 .09 .12 96 .21 427 .J4 .42 .SO .59 .68 .76 .83 .89 .94 197 .99 

20 »O3 .OF Oi .02 .03 .04 .06 .08 «th 215 220 226 233 243 .$50 .59 69 1.77 85 .91 19S 198 199 

ai -01 .01 .01 .02 .03 .04 .05 .08 39 114 199 625 132 241 050 1.60 .70 .79 1.87 293 .97 99 

22 -O1 Ot .01 .02 .03 605 07 .O9 213 .18 124 31 .40 150 .60 .71 161 689 .95 .98 

23 -01 .0t .02 .03 .04 .06 .06 «12 .17 1.23 130 .40 .50 61 .73 163 191 296 99 
24 +01 .01 O1 02 03 OS .07 .1f .15 228 .29 .)9 250 .62 .75 .06 .9%4 .98 
25 ‘Ot OI .02 .02 .04 .06 09 .1) .19 .2? .38 .SO .64 .7?? 89 .97 
z6 +01 .O1 .02 .0} .0S .07 «51 17 1.25 .36 .50 .66 .Bt .94 
2 =O) .01 202 .03 .05 .09 1.14 123 .34 250 169 .88 
28 201 O3 602 1.04 .06 613 619 631 150 17S 
29 -Ot .02 .03 .06 .13 .25 .50 


NOTE: This table is symmetric in the sense that Prob(A wins) with score m, n equals Prob(B wins) with score n, m. This occurs if and only if p = .5. 


Source: Lane and Ziemba (2004). 
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TABLE 2 Consistent Odds for Teams A and B for Given Q 


Payback rate, Q 


257 


House odds, O,/1 1.00 0.950 0.900 0.850 0.800 0.750 0.700 
0.100 10.000 9.025 8.100 7.225 6.400 5.625 4.900 
0.111 9.000 8.123 7.290 6.503 5.760 5.063 4.410 
0.125 8.000 7.220 6.480 5.780 5.120 4.500 3.920 
0.143 7.000 6.318 5.670 5.057 4.430 3.938 3.430 
0.167 6.000 5.415 4.860 4.335 3.840 3.375 2.940 
0.200 5.000 4.512 4.050 3.613 3.200 2.813 2.450 
0.250 4.000 3.610 3.240 2.890 2.560 2.250 1.960 
0.333 3.000 2.707 2.430 2.167 1.920 1.688 1.470 
0.500 2.000 1.805 1.620 1.445 1.280 1.125 0.980 
1.000 1.000 0.903 0.810 0.723 0.640 0.563 0.490 
1.500 0.667 0.602 0.540 0.482 0.427 0.375 0.327 
2.000 0.500 0.451 0.405 0.361 0.320 0.281 0.245 
2.500 0.400 0.361 0.324 0.289 0.256 0.225 0.196 
3.000 0.333 0.301 0.270 0.241 0.213 0.188 0.163 
3.500 0.286 0.258 0.231 0.206 0.183 0.161 0.140 
4.000 0.250 0.226 0.202 0.181 0.160 0.141 0.123 
4.500 0.222 0.201 0.180 0.161 0.142 0.125 0.109 
5.000 0.200 0.180 0.162 0.144 0.128 0.113 0.098 
5.500 0.182 0.164 0.147 0.131 0.116 0.102 0.089 
6.000 0.167 0.150 0.135 0.120 0.107 0.094 0.082 
6.500 0.154 0.139 0.125 0.111 0.098 0.087 0.075 
7.000 0.143 0.129 0.116 0.103 0.091 0.080 0.070 
7.500 0.133 0.120 0.108 0.096 0.085 0.075 0.065 
8.000 0.125 0.113 0.101 0.090 0.080 0.070 0.061 
8.500 0.118 0.106 0.095 0.085 0.075 0.066 0.058 
9.000 0.111 0.100 0.090 0.080 0.071 0.063 0.054 
9.500 0.105 0.095 0.085 0.076 0.067 0.059 0.052 

10.000 0.100 0.090 0.081 0.072 0.064 0.056 0.049 


Source: Lane and Ziemba (2004). 


Os, = Q*O, — (1 — Q*). Actual house odds, Oa, may differ from the consistent odds 
O, and O, for a number of reasons, including the desire of the oddsmakers to balance 
their books, competition among individual bookies for larger shares of the total pool, or 


additional information about teams’ performance. This then adjusts the actual 1 — Q* 
that the house receives. 
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For team A, or respectively B, the house odds are: Oaa = Oa (consistent), Og, > Ou 
(favorable), and Oaa < Oa (unfavorable). 


2. THE ARBITRAGE 


Arbitrage is the proverbial sure bet. For each betting point of a K point game with a 
utility function U and betting wealth W, optimal arbitrage bets can be found by solving 


max E [U (Ba, Bs)] (3) 
B,>0,B,>0 


s.t. B+B, SW 
BaOan + Ba > Ba + Bp 
B, Oy, + B, > Ba + By 


where B, and B, are the amounts bet on A and B, respectively, and E is the expectation 
operator. Besides the budget constraint, the arbitrage constraints indicate that the return 
if either A or B wins is never less than the total bet on A and B. This reduces to the 
arbitrage betting condition at every point of the game. These constraints yield 


1/Oan < Ba/Br < On, By #0, (4) 
which demonstrates: 
Theorem 1. The arbitrage exists if 
OchOvn 2 1. (5) 


The arbitrage condition in Equation (5) is utility free and holds for all U. Both 
B, and B, must be positive or both zero. 


The constraints in Equation (3) imply that 
B,/ By, < W/B, - 1. 
By Equation (4), 
1/Oan, < W/B, —- 1. 
Hence 
By < OanW/(1 + Oan). 
Similarly, 


Ba 2 w/a + Oan). 
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We consider betting strategies using the strategy variable f > 1, using 
Ba = fW/(1 + Oan) and By = W(1 — f + Oar)/C + Oan), 
when 
OchOnm = 1, Ba = W/(1 + Oan), Bo = OchW/(1 + Oan) and f = 1. 


To guarantee the arbitrage, the profit must be non-negative regardless of which team 


wins: 
profit if A wins = W (Oan f — (1 — f) — Oan)/(1 + Oan) (6) 
=W(f-1)20, 
profit if B wins = W (O (1 — f) + Ox,0an — P) + Oar) (7) 


= W (Onl + Oan) — FC + Oon))/CL + Och) > 0. 
To satisfy Equations (5—7) 
Smin = 1S f < Opn(1 + Ogh)/C1 + Obr) = fmax: (8) 
Figure 1 illustrates how the net payoffs vary with f. For 
S” = (Oan + 1)(Oon + 1)/(1 + Och + Ooh +2), (9) 
one maximizes the minimum arbitrage profit 
W (OarOen — 1/Och + Orn + 2). 


More insight about the arbitrage betting condition in Equation (5) may be obtained by 
comparing it to consistent odds at each betting point. They require that 


0,05 = Q’. (10) 
The house odds favorability factors rą > 0 and r, > 0 for favorable odds are defined by 
Oan = Oall +2) and Op, = On(1 + re). (11) 

Then Equation (5) implies that 
QA Hra +r) > 1. (12) 


Note that r4 > O and rg > 0, although typical, is not required for Equation (12) see also 
Figure 2. 
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Profit 


WERT (Ay EA ae Se RR SSH Sema SRS a Ais 


max H (B) 


Maximin | Sea SSS 


hedge 
profit 


f, Strategy variable 


fnin = 1 fmax 


FIGURE 1 Profits for Teams A and B for initial betting wealth of | unit vs. strategy variable f, when 
Oan > Opn and OghOpn > !. Source: Lane and Ziemba (2004). 


Q1 > Q2> Q3 


0 rb 


FIGURE 2 House odds favorability regions. Source: Lane and Ziemba (2004). 


If ra = rẹ = r, the schedule of Q versus r and the region of betting under the perfect 
hedge is as shown in Figure 3. 

For typical values for Q of about 0.85 the required favorability for house odds quoted 
on both teams is nearly 20%. Such discrepancies occur occasionally in actual betting. 
More frequently, however, one needs to take added risk to get good bets so we now turn 
to the construction of risk arbitrages. 
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r 


House odds 
favorability, 
lg=lp=r 


Q, Payback rate 
0 0.5 1.0 


FIGURE 3 Betting region of Q vs. r. Source: Lane and Ziemba (2004). 


3. RISK ARBITRAGES 


Two approaches are considered for constructing risk arbitrage positions. They exploit 
the observed house odds favorability conditions to find good arbitrages strategies. 
A maximal capital growth model encompassing these approaches is 


unos (Un W(a)] (13) 


st È (Bali) + Beli) < Wo 
i€g 
È, BaliOzali) > a $, Boli) 
ieg ieg 
È BoliOn(i) 2 a $, Bali), Yg EG 


ieg ieg 


where E, represents mathematical expectation with respect to the game path g, G is the 
set of scenario game paths from (0, 0) to the final outcome, W (g) is the wealth with 
g, and the constant a > 0 is the relative degree of risk of the bettor in the arbitrage. 
For example, if « = 3/4, the bettor requires that total returns must cover at least 75% 
of total bets in any game. With a = 1, the goal is to find an arbitrage to cover all bets 
and a premium is required for betting in any game if a > 1. The log utility function 
corresponds to the Kelly (1956) system of betting which maximizes the asymptotic 
long run rate of growth of the bettor’s fortune. See MacLean and Ziemba (2006), Thorp 
(2006), and Ziemba and Ziemba (2007), for a summary of results concerning such 
betting strategies. 

The first approach to the risk arbitrage problem is a model that analyzes the objec- 
tive function over all feasible paths of the game and computes the bets B,, B, which 
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TABLE 3 Cardinality of G is Large 


Game 
Points in Total betting points Number of 
the game betting points Min Max game paths 


1 1 1 1 2 
2 4 2 3 6 
3 9 3 5 20 
4 16 4 7 70 
5 25 5 9 252 
30 900 30 59 5.91 x 10!6 


maximize Equation (13). This presupposes that information is known in advance 
about the odds Oan and Obr, actually set by the house throughout the game. This 
information may be a probability distribution or a function for Oan and Og,, over the 
scores of the game. The drawback is that the cardinality of G is very large as shown in 
Table 3. Calculations for two and three point games using a = 1, Q = 0.85, and p = 0.5 
appear in Table 4. These calculations utilize the following probability distributions for 
the house odds favorability factors rg = r, = r: 


1. r ~ N(0,o7) is iid for all scores of the game. 
2. r~ N[r(i),o7], depends upon the size of the lead and the number of points the 
leader is away from winning the game. 


This relation is given by 


ri) = exp(—M/D)-— 1 if teami is leading by M points 
exp(pM/D)-—1 __ if team i is trailing by M points, 


D is the number of points the leader is away from winning the game, namely K — S, 
where K are the points needed to win the game and S is the score of the leader. 

The results yield insights that may be useful in the construction of good heuristic 
strategies for 30 point games. For the normally distributed favorability factors, there is 
an intrinsic threshold value for the odds favorability below which no initial bet is placed. 
This threshold is about a 20% odds favorability. Secondly, betting almost always takes 
place in natural-arbitrage pairs, where a bet on one team at one point of a particular 
game path is paired with a compensating bet on the other team later in the game along 
the same game path. Where perfect arbitrages could be constructed, these dominated 
all other possible bet points. As variance increases, the number of lucrative bets also 
increases both in number and in size of bet. 
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TABLE 4 Sample Results for Two and Three Point Games 


House Games bet per Average percent of bet Average value of 
odds type W œ games played compared to game bet funds per game played 
Two point games 
(1) 1 0 O/1 0 0 
1 l 1/10 100 0.0005 
l= 32 5/10 84 0.0073 
(2) 1 0 0/1 0 0 
1 cl 0/10 0 0 
I-32 2/10 100 0.0119 
Three point games 
(1) 1 0 oft 0 0 
Lo al 0/10 0 0 
1 2 7/10 80 0.0193 
(2) t 0 0/1 0 0 
Me ad 3/10 75 0.0098 
‘ey 4 5/10 60 0.0059 


Source: Lane and Ziemba (2004). 


Many betting points occurred in pairs of tied scores and trailing team bets, where 
the higher odds for the trailing team boost the combined bet pair over the arbitrage 
condition requirements. Under these situations, gains could be realized on a team that 
came from behind to win, and strategies can concentrate on this possible event occurring 
while hedging a priori that the leading team wins. 

In the exponential function distribution, the high relative favorability of the trailing 
team contributes to still greater emphasis on betting on the trailer to come from behind 
to win while hedging (usually early in the game, or at a tied point) on the leader to win. 
As in the normal case, the initial bet favorability threshold value of 20% continues to 
manifest itself in the results. 

Additional calculations showed modest expected total gains in the range of 1-2% 
using this method for sets of 25 games. The gains in the three point game are larger 
than the two point game for the same level of variance, which suggests that higher gains 
could be anticipated as the game size increases and more betting opportunities arise. 

We now utilize these insights in a second approach to risk arbitrage by constructing 
single arbitrage bets for the 30 point game. A single arbitrage is a bet on one team and 
subsequently a bet on the opposing team later on in the game such that the constraints 
of Equation (13) hold. Unlike arbitrage, which required betting both teams at the same 
time, this risk arbitrage does not require concurrent bets. The idea is to exploit the 
favorability of quoted odds on one team at some point in the game and take the risk 
that the house odds will become attractive enough on the opposing team later on so 
that an arbitrage may be constructed. However, the second half of this bet will not 
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always materialize. Volatility and prediction models are useful here. This hedge may be 
formulated by examining the constraints on the bets B and 


Ba(Sa)Oan(Sa) 2 aBs(Sa), a > 0 


(14) 
ByCS5)Oon(S4) > Ba Sp), 
where S, and S, are the scores of the game at the time a bet is made on team A and B, 
respectively. 

The approach taken in the construction of the single hedge is to simulate the passage 
of the 30 point game using the same assumptions about odds favorability as with the 
two and three point game model. 

The simulation model of the 30 point game generates at each point an odds favora- 
bility for each team, and the winner of the point using a uniform distribution on p. Bets 
were generated by first initializing a threshold favorability value which is used as the 
basis for placing the initial bet. Once the initial bet is placed whose amount is deter- 
mined by criteria discussed below, the game continues until a new betting point for the 
opposing team is found such that the arbitrage condition in Equation (5) is satisfied. At 
the end of the game the bets are settled and the results recorded. Each simulation run is 
a set of 30 point games. 

To find better strategies, and study the sensitivity of the model, different sets of games 
were simulated and compared in order to find the threshold favorability values which 
gave consistent positive expected net gains over the entire set of games. The results 
are highly sensitive to the value of this parameter. Low threshold values typically mean 
early betting points and more likely completion of the single hedge. However, early bets 
typically mean low odds and thus a small hedge margin (the amount by which OarObn 
actually exceeds 1) and consequently smaller net gains. Higher values mean delayed 
betting points with a greater chance of not completing the second half of the hedge. 
However, potential loss due to unhedged or single bet games is compensated for by 
higher hedge margins (due to later game scores, larger point spreads and higher odds) 
and hence more potential net gains when the risk arbitrage is successfully completed. 
Typical results for these single arbitrage betting pairs appear in Table 5. In two games, 
the risk arbitrage was completed. One leads to a positive gain, the other to a break even 
situation. In the other game, the risk arbitrage was not completed and leads to a loss. 
If a risk arbitrage is not completed, it invariably leads to a loss because the team that you 
would like to bet on to complete the quasi-hedge remains ahead throughout the game. 

Figure 4 gives a profile of three different sets of games under various initial bet 
threshold favorability values for the normally distributed case. 

In the cases illustrated and described above, the implicit assumption in the con- 
struction of the hedge and the determination of the amounts bet was that the arbitrage 
condition in Equation (5) is satisfied as an equality, that is, OarOsa = 1. Thus the arbi- 
trage margin was assumed to be zero in the construction of the arbitrage bet pair. 
However, if a margin, m > 0 exists, such that 


OgnOp, = 1 +m 
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TABLES Sample Results for Single Risk Arbitrage Betting Pairs 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
10 1. B 0.8581 18 23 0.1288 0.1654 0.2838 
2 A 0.1419 6.2963 7.6719 0.2185 
Final 
score Bets Payoff 


Team A 31 0.1419 1.0886 
Team B 29 0.8581 —0.8531 
Net betting payoff = 0.2304 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
11* l. A 0.0306 14 23 25.7480 31.6751 0.2302 
Final 
score Bets Payoff 
Team A 15 0.0306 —0.0306 
Team B 31 0.0 0.0 


Net betting payoff = —0.0306 


Game Amount Score Actual House 
number Bet Team bet A B odds odds Favorability 
21 k B 0.6493 0 3 0.4466 0.5401 0.2094 
2. A 0.3507 l 5 2.0653 1.9782 —0.0422 
Final 
score Bets Payoff 


Team A 17 0.3507  -0.3507 
Team B 30 0.6493 0.3507 
Net betting payoff = 0.0000 


* Arbitrage condition not realized, only one bet placed. Source: Lane and Ziemba (2004). 


then the effect is to permit bets to place more emphasis on one team or the other or 
neither within the limits of satisfying the constraint set of Equation (13) (see Figure 1 
for the arbitrage case). The perceived existence of a risk arbitrage margin allows the 
bettor to express the bet as a function of the score at the time of the bet placement, or as 
merely a prediction about the eventual outcome of the game. The actual margin cannot 
be known until the arbitrage condition is satisfied at the second betting point. However, 
because the value of the margin affects the initial bet, it must be anticipated by some 
estimate. 

If a margin exists, then the arbitrage, if it has been successfully constructed, satisfies 
all the conditions required for a member of the infinite set of betting pairs defined by 
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Total f 
net a 
gain coe 


0 Threshold 


20 


s po favorability 
Positive total 
net gain region 
common to all 
games sets 
Set C 
FIGURE 4 Thirty point game, single risk arbitrage simulation ro search. Source: Lane and Ziemba 
(2004). 
TABLE 6 Total Gain Values for a 100 Game Set 
fi min f* fi max 
m/ro 0 0.05 0.10 0.20 0 0.05 0.10 0.20 0 0.05 0.10 0.20 
0 1.41 0.47 -0.22 2.22 I4 0.47 —0.22 2.22 14l 0.47 -0.22 2.22 


0.05 2.22 1.09 -0.02 3.35 2.01 0.93 -0.05 3.50 1.92 0.86 -0.10 3.89 
0.10 2.67 2.43 -0.02 3.18 2.32 2.25 -0.08 3.62 2.07 1.92 —0.15 4.27 
0.20 1.19 0.08 -147 3.09 0.78 —0.03 -1.50 409 0.17 -0.68 -1.57 5.17 


NOTE: Exponential function favorability, o = 0.2, constant strategy. Source: Lane and Ziemba (2004). 


the single betting strategy variable, f of Figure 1. However, if the margin should not 
materialize, then the risk is that the risk arbitrage will not be completed and the game 
may end with an unpaired bet that may be lost. If on the other hand the anticipated mar- 
gin understates the actual margin, then there is an opportunity loss due to the wrongly 
specified betting split. Hence even though the bet is not lost, we could have done better 
had we known the exact margin value. 

This model was used in this second way to search for best values for the anticipated 
margin factor, m and the threshold favorability factor, ro values for the different sets of 
games described above. In the first instance, a constant betting strategy was employed 
independent of the score at the time of the initial bet. The strategy variable used here 
corresponds to the upper and lower limits of f, as well as f*. 

Typical results for various m values are in Table 6. 
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When the anticipated margin is zero, the results of the three strategies are identical 
since the zero margin assumption uniquely defines the bets for each team (given by 
f=. 

For some (m, ro) pairs, the fmin Strategy yields better results while for other pairs fmax 
is superior. The f* results always take intermediate positions between fmin and fmax for 
all (m,r). The results for all three strategies however are not significantly different for 
any (m, ro). This performance is not unexpected and for larger game sets it is expected 
that there would be less difference between the results of these three strategies. 

As m increases for constant ro, the total gain rises and then falls off as the second half 
of the bet pair is more difficult to complete. Low ro returns are, in general, positive with 
low variance since bets are placed early and the pairs are completed early, generally with 
low odds. High ro values yield better mean returns with larger variances. Intermediate 
ro values reduce the returns. The results of this strategy for the same set of games used 
in Table 6 appear in Table 7. 

Comparing the automatic versus constant variable strategies shows that the automatic 
strategy dampens the extreme results of the fmin and fmax strategies of Table 6 while 
improving on the more conservative results of f*. While these results may not be consid- 
ered as being significantly different, the trend of the automatic strategy is toward a more 
stable and profitable outcome. More importantly, the variance of the expected gains is 
reduced by about a third from six to four over the comparable results in Table 6. Finally, 
the automatic strategy, being score dependent, is more intuitive and appealing and is the 
preferred policy for the single hedge construction problem in the 30 point game. 

While single risk arbitrage jai alai results are encouraging, the gains that occur for 
the simulated game sets are modest. There is greater potential for larger gains in longer 
games as there is more opportunity for inefficiency to manifest itself through volatility. 

Combining the results of the two risk arbitrages, our final analysis examines the 
policy of betting over the 30 point game through the construction of a series of arbitrage 
bet pairs. 

The simulation model discussed above was modified to accommodate a series of 
single arbitrage bets. The same amount was assumed available for each arbitrage bet 
pair with a maximum total bet availability requirement of 60 betting units. 


TABLE 7 Single Hedge Construction; Total Gain Over 100 
Games Sets Using an Automatic Strategy Variable 


Threshold favorability. ro 


m 0 0.05 0.10 0.20 
0 1.41 0.47 —0.22 2.22 
0.05 2.11 1.03 —0.02 3.55 
0.10 2.49 2.33 0.02 3.61 
0.20 0.94 0.02 —1.24 3.12 


Source: Lane and Ziemba (2004). 


140 Calendar Anomalies and Arbitrage 


268 Chapter 13 + Arbitrage in Team Jai Alai 


TABLE 8 Betting Summary: Multiple Quasi-Hedge Bets 


Amount Score Actual House 
Bet Team bet A B odds odds Favorability 
1 A 0.5459 ! 0 0.6894 0.8733 0.2668 
1 B 0.4541 l 0 1.048 1.2047 0.1495 
2 A 0.523 1 1 0.85 0.9356 0.1007 
2 B 0.477 4 2 1.3188 1.4157 0.0735 
3 A 0.535 2 1 0.6869 0.9127 0.3288 
3 B 0.465 2 t 1.0519 1.16 0.1028 
4 A 0.6342 4 2 0.5479 0.6057 0.1055 
4 B 0.3658 9 6 1759 1.7204 ~0.0219 
5 B 0.3139 10 7 1.7899 2.1853 0.2209 
5 A 0.6861 10 9 0.6606 0.7306 0.1059 
6 B 0.3855 11 9 1.4189 1.5938 0.1233 
6 A 0.6145 12 11 0.6517 0.6392 —0.0192 
7 B 0.3167 12 10 1.4385 2.1578 0.5 
7 A 0.6833 12 10 0.5022 0.5551 0.1052 
8 A 0.4782 12 12 0.85 1.1168 0.3139 
8 B 0.5218 25 24 1.4049 1.0277 —0.2685 
9 A 0.2919 19 21 1.7751 2.4262 0.3668 
9 B 0.7081 19 21 0.407 0.4617 0.1344 
10 A 0.2301 19 22 2.6866 3.3467 0.2457 
10 B 0.7699 21 23 0.3706 0.338 —0.088 
11 A 0.4152 22 23 1.3005 1.4085 0.0831 
11 B 0.5848 23 23 0.85 0.7379 —0.1319 
12 A 0.4123 23 24 1.3452 1.4254 0.0596 
12 B 0.5877 25 25 0.85 0.8935 0.0512 
13 B 0.4613 24 24 0.85 1.1942 0.4049 
13 A 0.5387 24 24 0.85 0.9764 0.1487 
14 A 0.4969 25 25 0.85 1.0372 0.2203 
14 B 0.5031 26 25 1.4898 1.5992 0.0734 
15 B 0.5303 26 26 0.85 0.9089 0.0693 
15 A 0.4697 28 29 255 3.3276 0.3049 
16 B 0.508 29 29 0.85 0.9926 0.1678 
Final 
score Bets Payoff 
Team A 29 7.5549  —7.5549 
TeamB 30 7.9532 8.7968 


NOTE: Net betting payoff = 1.2419. Source: Lane and Ziemba (2004). 
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TABLE 9 Exponential Function Favorability, 
g = 0.3; 100 Game Set Hedge Series Construction 


Threshold favorability, ro 


m 0 0.1 0.2 

0 -1.96 2.12 11.29 17.21 
0.05 —0.76 4.51 13.27 19.12 
0.1 0.79 3.77 10.34 17.26 
0.2 2.83 4.74 8.19 14.17 


Source: Lane and Ziemba (2004). 


The algorithm proceeds by requiring that the preset threshold favorability factor 
value is satisfied before the initial bet of any pair is made. Priority over initializing the 
first half of a new risk arbitrage is given to matching unmatched hedges. The automatic 
strategy policy is used for the construction of all pairs. 

Table 8 describes the bets in a typical game where 24% of a betting unit is the 
total profit. A summary of the results of this simulation appears in Table 9. The results 
are encouraging. Depending on the anticipated margin and the threshold value chosen, 
the number of hedges completed in the simulated games can range from 1 to 20 with 
never more than three uncompleted bet pairs among this series. Losses may be incurred 
on any particular game under this imperfect hedge strategy due to the possibility of 
uncompleted bet pairs. Such losses never exceeded one betting unit for any of the sim- 
ulated games, whereas the single game gain ranged as high as three units. Game sets 
are divided approximately 60/40 in terms of winning to losing games for the sets exam- 
ined. The total gain over the entire set of games is increased relative to the single hedge 
strategy and the instance of loss is reduced. As the odds favorability variance increases, 
the potential for more profitable bets occurs and the expected gains and variance rise 
accordingly, positive net gains occur regularly for similar values of the standard devia- 
tion of the odds favorability distribution. The variance of the expected gains are larger 
than for the single hedge case (40 versus 6) as expected. 


4. FINAL REMARKS 


It is possible to construct profitable arbitrage strategies for the 30 point jai alai game. 
Modest returns may be realized under strategies of arbitrage and risk arbitrage sin- 
gle bet pair constructions. Mathematical programming results imply that series of bet 
pairs may be optimal for games of this kind. Simulation results suggest that improved 
gains may be obtained under such strategies where bet placements are dependent on the 
favorability of quoted odds and the score. Further analysis of this situation might con- 
centrate on more detailed investigation into the actual distribution of quoted house odds 
during the game. This will involve more intensive data collection at jai alai frontons. 
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Data collection in Mexico City indicates that there are substantial inefficiencies. The 
formulation here has assumed score invariant single point win probabilities. A more 
refined but possibly unmanageable analysis might consider score variant probabilities 
possibly using Markov chains. More study should also be undertaken with respect to 
the reaction of the oddsmakers to shifts in score and an examination of their individ- 
ual objective functions. Parayre (1986) has some results along these lines for win and 
perfecta bets based on player strength and post positions. These results follow ideas in 
Ziemba and Hausch (1986). 

The methodology and insights found for team jai alai also have potential applications 
in other situations where one has non-marketable financial instruments. These include 
certain horse racing (especially on betting exchanges), currency exposure, and produc- 
tion situations. Risk arbitrage in traded options and warrants markets is an additional 
example. See Shaw et al. (1995) for one such application related to the Japanese Nikkei 
put warrant in 1989-1990. 

In England and other European and Commonwealth countries, legalized bookies set 
odds that various horses will win a given race both on-course and off-course. These odds 
may differ across bookies at a particular moment in time. The odds change during the 
20 or so minutes before a race is run as opinions are altered in light of new information 
such as the horses’ appearances and because the bookies would like to simultaneously 
balance their books to guarantee a profit no matter what horse wins, and maximize 
the number of tickets sold. This situation, from the bettor’s perspective, mirrors the 
jai alai situation, once extended to multiple outcomes, assuming that he has an indepen- 
dent estimate of the probability that each horse will win obtained by a handicapping or 
statistical procedure. 

A key feature of the team jai alai and racing situations is that the tickets once pur- 
chased are not marketable except perhaps at a substantial discount. Other situations 
share these features and we will describe two of them briefly here. 

Consider a company with substantial foreign accounts receivable at a future date. The 
standard way to hedge against possible devaluations is through a futures contract in the 
country’s currency. However, in many cases this is not possible because the currency 
does not have an active futures market or the time horizon is too long. The curren- 
cies of Italy, Thailand, and Turkey are examples of the former. Even for established 
heavily traded currencies such as the Euro and the Mexican Peso, such contracts will 
not cover a multiple year exposure. Negotiations with a bank might produce a spe- 
cial forward contract for part of the exposure. Such a contract would be difficult to 
sell except at a substantial discount. As time goes on, the company may add additional 
contracts to cover more of the exposure with the original or other banks. In terms of the 
jai alai formulation, one may think of the original exposure and any subsequent accounts 
receivable as bets on A and the covering as bets on B. 

Farmers often have fixed contracts for delivery of the crops from their acreage at a 
specified time. Both the price he or she will receive and quantity he or she will have 
available are likely uncertain. In a publicly traded commodity such as corn or wheat 
he or she could hedge against these uncertainties. However, active futures markets are 
not available for most commodities. Lettuce and raspberries are two such examples. 
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The farmer can consider his or her crop as bets on A and contracts he or she makes with 
other farmers of specific quantitites at fixed prices as bets on B. 

Some analyses of problems similar to these two examples using hedging arguments 
for static problems appear in Anderson and Danthine (1981), Feiger and Jacquillat 
(1979), McKinnon (1967), and Rolfo (1980), and for a two period problem in which 
additional information becomes available, see Baesel and Grant (1982). 
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WILLIAM T. ZIEMBA*: The evidence is very strong for the existence of a turn- 
of-the-year effect. The prices of small-capitalized stocks increase significantly 
relative to large-capitalized stocks on the first ten or so trading days of January. 
The effect seems to begin on trading day —1 where there is a tendency for a shift 
in sales of the small stocks at the asked rather than at the bid. Trading days +1 
to +4, on average, show enormous gains in the small stocks over the large stocks 
and this effect continues until mid-January. The total average difference between 
the smallest and largest deciles of stocks from —1 to +9 is on the order of 6 to 10 
percent. Studies supporting this effect use data over extremely long periods. For 
example, the original Rozeff-Kinney study concerned the period 1904-74. The 
period from 1962 to present has been especially thoroughly analyzed. One 
additional useful empirical study not referenced by Ritter is Smidt and Stewart 
[7]. 

Why does this effect occur and with such regularity? Jay Ritter’s excellent 
paper provides us with more insight into the understanding of this phenomenon 
with an analysis of the “parking-the-proceeds” hypothesis. Using a unique data 
set for the fifteen turns of the years from 1971 to 1985, he finds that the buy/ 
sell ratio of individual investors at Merrill Lynch explains 46% of the variance 
of the excess returns of small over big stocks in the first half of January. During 


* Faculty of Commerce and Business Administration, University of British Columbia, and Institute 
of Socio-Economic Planning, University of Tsukuba, Japan. Without implicating him, I would like 
to thank Jay Ritter for comments on an earlier draft of this discussion. 
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this period the small stocks gained 8.17% more than the big stocks on days +1 
to +9 or nearly 1% per day. Figure 1 visually shows the argument in a convincing 
manner. 

The “parking-the-proceeds” argument is based on three suppositions. First, when 
individuals buy stocks, they buy disproportionaly more small stocks than big 
stocks relative to institutional investors. Some supporting evidence is presented, 
and this hypothesis is reasonable since it is these very investors who have loaded 
up the large mutual and pension funds with the vast bulk of the market wealth, 
and these institutions do seem to concentrate on the large-capitalized stocks 
since in many cases they are judged on their performance relative to the S&P 
500. Moreover, the individual investors (especially those well healed enough to 
have accounts at Merrill Lynch) likely buy more small stocks in their personal 
accounts to diversify. Secondly, the price of these small stocks is affected by 
buying pressure—again reasonable as demonstrated by Stoll and Whaley [8] and 
others. Finally, individuals are net buyers in early January because of the proceeds 
remaining from December’s tax-motivated sales, and their purchases are large 
enough to move the market. The arguments that individuals sell in December to 
take losses, that they like to see the money in their account at the end of 
December before repurchasing in January, and that they receive pay and bonuses 
starting on day —1 that can be reinvested appear sound. Ritter’s data indicate 
that relative to the rest of the year individuals are net buyers in January but that 
in absolute terms they have been net sellers for each of the fifteen years in the 
sample. It is likely that the net selling is movement into mutual funds and away 
from individual accounts. Perhaps, as he suggests, in January these individuals 
are switching into small stocks and out of big stocks with capital gains. This 
needs to be checked. There is an analogy of the late-charging horse running 
faster to nip a front runner at the wire when in fact the charger is not accelerating 
at all. The horse is simply declerating at a slower rate. 

Obviously there are a number of factors that lead to the turn-of-the-year effect. 
The studies of Ariel [1] with spot data for 1963-1981 and Sick and Ziemba [6] 
with futures data from 1982-1988 show that throughout the year the small stocks 
outperform the big stocks in the trading interval —1 to +9 and particularly —1 
to +4. See also Keim and Smirlock [4]. The arrival of new money from income, 
bonuses, parked proceeds and other sources seems to fuel the net purchase of 
small stocks relative to big stocks in this period. 

The data also seem to support the hypothesis that the turn-of-the-year effect 
is strongest following a bear market. January 1988 following the October 1987 
crash was certainly a strong example of this. 

Why does the market not respond to this effect to eliminate it as it should in 
an efficient market? In fact, at least for the turn of the year, it does seem to do 
this in the futures markets. Clark and Ziemba [2] have shown that the futures 
price of the Value Line Composite begins to anticipate the turn-of-the-year effect 
around the middle of December. Moreover, the extent of this anticipation is 
capturing more and more of the expected gain of the turn-of-the-year effect as 
information about the effect becomes more widely known partly through trade 
books like Haugen and Lakonishok [3] and Ziemba [11]. The expected gain in 
the futures market of the Value Line Composite over the S&P 500 in the mid- 
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December to mid-January period has averaged about 2 to 3 percent in the past 
10+ years, with about two thirds of this being anticipated. 

I am in agreement with Ritter that institutional factors such as he discussed 
and the flow of funds at the turns of months and particularly at the turn of the 
year are among the main causes of the bidding up of the small stocks at these 
times. It is very useful to have reliable risk-return equilibrium relationships to 
explain asset prices. Even the substantial increase in risk during January found 
by Tinic and West [9] and Ritter and Chopra [5] does not explain the effect. 
The latter paper also concludes that risk is rewarded according to the CAPM 
only in January and then only for small stocks. Otherwise, this popular model 
does not explain the risk-return relationship. 

Theoretical studies such as Williams’ [10] asymmetric-information model, 
using better-informed inside traders and tax sellers that yields a bounce back of 
prices in January once the tax selling is completed and the risk premiums fall 
back in line, will be useful to piece together the missing 54% of the explanation. 
However, it does seem that the turn-of-the-year effect is caused by a number of 
factors coming together at one time and that they are largely institutional. Hence 
it will be hard for any one model to be the sole explanation of the effect. 
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1 


Introduction 


This paper explores an interesting question—whether January equity returns 
forecast the direction of the returns in the rest of the year. Yale Hirsch (1986) 
calls this the January barometer. The supposition is that: 

If the market rises in January, then it will also rise during the rest of the year, 
but if it falls in January, then there will be a decline during the rest of that year. 

Hirsch (1994) in his yearly “Stock Traders Almanac” actually reports results 
on the hypothesis “if January rises then that year rises.” While Hirsch includes 
January in the returns for the year, we believe that January returns should be 
excluded. Therefore, this study uses the italicized definition above, which 
excludes January returns and examines the rest of the year returns separately. 
This methodology provides for a clearer statistical test and, we believe, more 
closely represents the essence of the hypothesis. 

In another paper (Hensel and Ziemba (1995)), we investigated the US case 
more fully. Data on the S&P 500 during the 68-year period 1926-1993 strongly 
support the first part of the hypothesis, especially from 1940-1993. However, 
there was evidence that negative January returns did not have any predictive 
power for returns in the next eleven months. 

In this paper we investigate the January barometer’s predictive power in 
many worldwide equity markets including Australia, Austria, Canada, France, 
Germany, Japan, Switzerland, and the United Kingdom during the period 1970- 
1993. The results indicate that positive Januarys have good predictive power but 
that indices aggregated across various regions, such as Europe or Pacific, have 
stronger predictive power. As in the US, negative Januarys have no predictive 
power. 


2 The US Evidence 


We investigated the January barometer in a related paper using monthly total 
return S&P 500 data from Ibbotson Associates, for the 68-year period: January 
1926 to December 1993.! 

The results in Table 1, especially from 1940-1993, strongly support the first 
part of the January barometer hypothesis, namely that, if the market rises in 
January then it will also rise during the rest of the year. During this latter 54-year 
period (bottom two lines of Table 1), January returns correctly predicted the 
direction of the rest-of-the-year returns 75.9% of the time. When the return in 
January was positive, the rest of the year had positive returns 91.2% of the time. 
However, when January returns were negative, the rest of the year had positive 
returns in only 50.0% of the years. The 91.2% was significantly higher than the 
50.0% at the 1% level with a two-tailed test. 

The barometer predicted worse than chance during the 1926-1939 period 
when positive Januarys resulted in positive rest of the years only 37.5% of the 
time (Table 1) versus 57.1% positive rest of the years during this period. That is, a 
model which always predicted positive rest of the years would have been correct 
over 57% of the time during 1926-1939. By contrast, the January barometer 
correctly predicted positive rest of the years 37.5% of the time—a very poor 
showing. However, the barometer was a good predictor in each of the following 
five decades. These five decades had success rates greater than 80% for positive 
January returns. Two decades, the 1950s and 1970s, had success rates of 100% 
for positive Januarys. 


1 Please see Hensel and Ziemba (1995) for the full results. 
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Period 
1926-1993 


1926-1939 
1940-1949 
1950-1959 
1960-1969 
1970-1979 
1980-1993 


1940-1993 
N (1940-1993) 


2 The US Evidence 


Table 1 


Success of the January Barometer in the US and Average Gains 


and Losses for February through December, 1926-1993 


January Barometer Success Average Feb-Dec Loss Average Feb-Dec Gain 

AllJans. Jan. Up Jan. Down  JanDn,RstYrDn JanUp,Rst¥rDn JanUp,RstYrUp JanDn,RstYrUp 
66.2% 81.0% 42.3% -8.1% -25.4% 18.4% 15.5% 
28.6 37.5 16.7 -5.8 -33.9 30.2 26.2 
80.0 85.7 66.7 -7.2 -15.3 16.4 9.2 
90.0 100.0 66.7 -3.9 0.0 23.0 9.9 
70.0 83.3 50.0 -6.8 -11.2 13.6 11.3 
80.0 100.0 60.0 -15.5 0.0 13.5 12.1 
64.3 88.9 20.0 -0.6 -7.5 17.4 8.9 
75.9% 91.2% 50.0% -8.3% -11.3% 17.2% 10.2% 
54 34 20 10 3 31 10 


Source: Hensel and Ziemba (1995) 


The barometer, when positive, has also been a signal for the magnitude of the 
February to December gains. For example, during 1940-1993 when the January 
barometer was positive and suggested a gain during the rest of the year and there 
actually was a gain in these eleven months, it was significantly higher (at 17.2%) 
than the average return (10.2%) during years when the barometer forecasted a loss 
and there was a gain (bottom of right panel, Table 1). This difference was 
significantly positive with a one tail t test at a significance level of 0.15%. 

When the barometer was negative and suggested a weak rest of the year and 
there actually was a loss during the rest of the year, that loss averaged -8.3%. This 
return was not statistically different from the -11.3% when the barometer was 
positive and the forecast failed (bottom of middle panel, Table 1). Also, during 
this 54-year period, when January was negative the rest of the year was negative 
50.0% of the time. This compares to negative rest of the year returns in 24.1% of 
the years. Thus the barometer predicted more frequent negative returns than 
actually occurred, and was not a useful predictor when there were negative 
Januarys. 

We concluded that since 1940 the January barometer, when positive, has 
provided a statistically significant signal that both the probability that a gain will 
occur in the rest of the year is higher than average, and the size of that gain, if it 
occurs, will be above average. A negative January barometer has provided no 
information concerning the rest-of-the-year’s returns.” 

These results are consistent with the hypothesis, discussed, for example, by 
Schwadel (1988), that the returns in January are dependent upon current 
economic activity such as the Christmas sales period in December. If such 
economic activity is high then January stock prices as well as the rest-of-the- 
year’s prices likely will rise. 


2 The success of the January barometer is not the result of January returns, on average, being larger 
than other months. Over this 68-year period, three other months had average returns greater than 
January. A t test comparing the differences in mean returns, across all months, produced no significant 
results. 
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3 Worldwide Evidence 


To study the January barometer in other worldwide markets and regions we 
utilized the Morgan Stanley Capital International Indices with monthly total 
returns for the twenty-four year period 1970-1993. For individual countries we 
used local currency returns for this research. However, for indices aggregated 
across countries it seemed more appropriate to focus on a single currency; we 
used the US dollar for this purpose. 

We investigated the January barometer’s success in local currency returns for 
the nine individual countries: Australia, Austria, Canada, France, Germany, 
Japan, Switzerland, the United Kingdom plus the United States; and in US dollar 
returns for the four aggregated regions: Europe, Pacific, EAFE and the World 
Indices. 


Table 2 
Success of the January Barometer in Nine Countries 
and Four Aggregated Regions, 1970-1993 


MSCI Country Indices 


in Local Currency All Jans. Jan. Up Jan. Down 
Australia 66.7% 78.6% 50.0% 
Austria 50.0 58.3 41.7 
Canada 58.3 80.0 22.2 
France 54.2 64.7 28.6 
Germany 58.3 63.2 40.0 
Japan 70.8 81.3 50.0 
Switzerland 70.8 76.5 57.1 
United Kingdom 83.3 94.1 57.1 
United States 66.7 92.9 30.0 
MSCI Regional Indices 
in US Dollars 
Europe 79.2% 78.9% 80.0% 
Pacific 75.0 82.4 57.1 
EAFE 70.8 82.4 42.9 
World 70.8 -87.5 37.5 


In general, we have the same conclusion as in the US (with the 1926-1993 
data). If January is positive, then the rest of the year is positive a high percentage 
of the time. For example, this percentage averaged 74.6% for the eight countries, 
not including the US. The percentage was higher for the Europe (78.9%), Pacific 
(82.4%), EAFE (82.4%) and World (87.5%) Indices. Similar to the US, when 
January is negative, the signal that the rest of the year is also negative is very 
weak.? Hence, the results support the predictive power of the January barometer 
for positive Januarys but not for negative Januarys. 

The results in the aggregated regions support this and are somewhat stronger. 
However, there is one notable exception. For the MSCI Europe Index the 
barometer has worked for negative Januarys as well as positive Januarys 80.0% 
and 78.9% of the time, respectively.4 


3 The success percentage averages only 43.3% for the eight countries. A statistical test indicates that 
74.6% is higher than 56.7% {negative Januarys implying positive rest of the years, 56.7% = 
100% - 43.3%), with a two-tailed test at the 5% level. Moreover, 43.3% (negative January barometer 
success) is not statistically higher than 25.4% (positive Januarys implying negative rest of the years). 

The success at predicting negative rest of the years is in contrast to the other aggregated indices 
and the individual countries. This result could be due to the small number of negative Januarys (5), 
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3 Worldwide Evidence 


To investigate the January barometer further, we present in Tables 3a-m (in 
the Appendix) the results for the individual countries and regions. An explanation 
of Table 3 is provided in Table 3a. 

Some of the main conclusions by country and region for 1970-1993 are: 

a) Australia — the barometer worked similar to the US: in 11 of the last 12 
years with positive Januarys, the returns in the rest of the year were also positive; 
negative Januarys had no predictive power; when returns were positive in 
February to December they were higher when January was positive. 

b) Austria — the barometer did predict slightly better than chance for 
positive Januarys and slightly worse than chance for negative Januarys but these 
results were not statistically significant even at the 10% level. 

c) Canada — the barometer did work for positive Januarys and provided no 
information for negative Januarys. The results were similar to those for the US 
which is not surprising because the stock returns in the two markets are highly 
correlated and the currency exchange rate has relatively low volatility. 

d & c) France and Germany — the January barometer predicted poorly in the 
1970s but very well for the positive returns in the 1980s; in neither period did the 
signal predict the level of returns better than chance. 

f) Japan — the results were similar to the US with the barometer predicting 
the probability and size of the rest-of-the-year’s returns with positive Januarys 
and providing no information for negative Januarys. 

g) Switzerland — similar to France and Germany; the barometer predicted 
poorly in the 1970s and more accurately in the 1980s; the signal for positive and 
negative Januarys gave an accurate prediction of the size of the returns in the rest 
of the year. 

h) United Kingdom — for positive Januarys, the barometer had a very high 
level of accuracy (94.1% overall and 100% in the 1982-1993 period) and gave an 
accurate prediction of the level of returns. For negative Januarys, the barometer 
did not provide a useful prediction of the chance of negative returns in the rest of 
the year. However, the barometer did provide a useful prediction of the magnitude 
of the February to December decline. 

i) US — the 1970-1993 results reported here are similar to those from 
1940-1993 reported in Hensel and Ziemba (1995) and summarized in Section 2. 

j) Europe — the MSCI Europe Index has 14 countries which include the 
five studied here plus nine others. However, the five studied here contain the 
majority of the market capitalization and hence dominate the results. The January 
barometer for the aggregated index for Europe was accurate for positive as well as 
negative Januarys (however, there were only 5 negative Januarys), and both gave 
accurate signals regarding the size of the returns. 

k) Pacific — the Pacific index is dominated by Japan and the results show 
this; the barometer predicted well both the probability and size of the rest-of-the- 
year’s returns for positive Januarys and provided no information for negative 
Januarys. 

1) EAFE — the MSCI Europe, Australia, and Far East Index results are 
similar to that of the Pacific with the barometer predicting well both the 
probability and size of the rest-of-the year’s returns for positive Januarys and 
providing no information for negative Januarys. 

m) World — the MSCI World Index is approximately the US plus EAFE 
both of which had the same result; the barometer predicted well both the 
probability and size of the rest-of-the-year’s returns for positive Januarys and 
provided no information for negative Januarys. 


possible effects introduced by measuring returns is US dollars, or through a reduction in the noise of 
each country forecast when they are aggregated into the regional index. 
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4 Final Remarks 


We do not have an adequate explanation as to why the January barometer seems 
to have predictive power when January returns are positive. There has been some 
conjecture that the January barometer might really be a first month of the 
corporate fiscal year phenomena. The most common corporate fiscal year in the 
US is the calendar year. Therefore, in countries where fiscal years and calendar 
years differ, if this conjecture is correct, we might expect to see the first month of 
the common corporate fiscal year predict the following 1! months returns better 
than January. 

We examined this possible explanation for the January barometer for three 
countries—Australia, Japan, and the UK. In Australia, many companies start their 
fiscal year in July, ending the following June. Japan and the UK have fiscal years 
from April through March. All three cases, using the first month of the fiscal year 
as an indicator for the rest of the year, performed worse than using January as the 
indicator. These results suggest, at least for these three countries, that corporate 
fiscal years were not the major factor contributing to the success of the January 
barometer. 

The actual reason why the predictive ability occurs is probably a combination 
of factors. Besides the one discussed above, and the Christmas business 
hypothesis discussed in the text, the very fact that January returns are usually high 
and are expected to be so is another possible reason. That may be why the 
barometer seems to predict well for positive Januarys (the expected result) and 
provide no information for negative Januarys. Still, the January barometer is an 
interesting and useful concept and indicator for stock investors in the US and 
other worldwide markets. 
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6 Appendix 


6 Appendix 


Table 3 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


a) Australia Index January Barometer 
Success -Success percentage of the January barometer for all 
Period All Jans. Jan. Up Jan. Down] Januarys, Januarys with positive returns, and Januarys 
with negative returns. 
-Frequency of yearly observations. 


1970-1993 66.7% 78.6% 50.0% 
N 24 14 10 : 
-Percentage of years with positive January returns. 
% Years with positive January 383 -Percentage of years with positive returns for the year. 
% Years with positive Returns 66.7 mined o years with positive rest of the year 
% Years with positive Feb-Dec 66.7 retumis (Feb DEC): 
1970-1993 


Average Feb-Dec Loss -These returns provide data on the magnitude of losses, 
JanUp, RstYrDn when there was a loss in the rest of the year (Feb-Dec), 
JanDn, RstYrDn for positive and negative January returns. 


Average Feb-Dec Gain -These returns provide data on the magnitude of gains, 
JanUp, RstYrUp when there was a gain in the rest of the year (Feb-Dec), 
JanDn, RstYrUp for positive and negative January returns. 
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Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


b) Austria Index January Barometer Success 


d) France Index January Barometer Success 


Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 

1970-1993 50.0% 58.3% 41.7% 1970-1993 54.2% 64.7% 28.6% 
N 24 12 12 N 24 17 7 
% Years with positive January 50.0 % Years with positive January 70.8 
% Years with positive Returns 62.5 % Years with positive Returns 70.8 
% Years with positive Feb-Dec 58.3 % Ycars with positive Feb-Dec 66.7 
1970-1993 1970-1993 

Average Feb-Dec Loss Average Feb-Dec Gain 
JanUp,RstYrDn -3.1% JanUp,RstYrDn -6.3% 
JanDn,RstYrDn -1.3% JanDn,RstYrDn -3.0% 

Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 15.3% JanUp,RstYrUp 12.9% 


JanDn,RstYrUp 


c) Canada Index January Barometer Success 


10.5% 


JanDn,RstYrUp 


e) Germany Index January Barometer Success 


24.7% 


Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 

1970-1993 58.3% 80.0% 1970-1993 58.3% 63.2% 40.0% 
N 24 15 N 24 19 5 
% Years with positive January 62.5 % Years with positive January 79.2 
% Years with positive Returns 75.0 % Years with positive Returns 70.8 
% Years with positive Feb-Dec 79.2 % Years with positive Feb-Dec 62.5 
1970-1993 1970-1993 

Average Feb-Dec Loss verage Feb-Dec Loss 
JanUp,RstY¥rDn -2.6% JanUp,RstYrDn -3.8% 
JanDn,RstYrDn -1.6% JanDn,RstYrDn -10.4% 

Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 10.8% JanUp,RstYrUp 12.0% 
JanDn,RstYrUp 12.4% JanDn,RstYrUp 16.7% 
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6 Appendix 
Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 
f) Japan Index January Barometer Success h) United Kingdom Index January Barometer 
Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 70.8% 81.3% 50.0% 1970-1993 83.3% 94.1% 57.1% 
N 24 16 8 N 24 17 7 
% Years with positive January 66.7 % Years with positive January 70.8 
% Years with positive Returns 75.0 % Years with positive Returns 83.3 
% Years with positive Feb-Dec 70.8 % Years with positive Feb-Dec 79.2 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp.RstYrDn -2.8% JanUp,RstYrDn -0.1% 
JanDn,RstYrDn -8.1% JanDn,RstYrDn -12.9% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 16.3% JanUp,RstYrUp 19.0% 
JanDn.RstYrUp 11.5% JanDn,RstYrUp 7.1% 


g) Switzerland Index January Barometer Success i) United States Index January Barometer 
Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 70.8% 76.5% 57.1% 1970-1993 66.7% 92.9% 30.0% 
N 24 17 7 N 24 14 10 
% Years with positive January 70.8 % Years with positive January 58.3 
% Years with positive Returns 70.8 % Years with positive Returns 79.2 
% Years with positive Feb-Dec 66.7 % Years with positive Feb-Dec 83.3 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 

JanUp,RstYrDn -3.6% JanUp,RstYrDn -0.4% 
JanDn,RstYrDn -8.4% JanDn,Rst¥rDn -4.5% 

Average Feb-Dec Gain Average Feb-Dev Gain 
JanUp,RstYrUp 14.4% JanUp,RstYrUp 14.5% 
JanDn,Rst¥rUp 5.0% JanDn,RstYrUp 7.2% 
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Table 3 (Continued) 
Summarized Results on the January Barometer for Nine Countries 
in Local Currency and Four Regions in US Dollars, 1970-1993 


J) Europe Index US$ January Barometer Success 1) EAFE Index US$ January Barometer Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan.Up Jan. Down 
1970-1993 79.2% 78.9% 80.0% 1970-1993 710.8% 82.4% 
N 24 19 6 N 24 17 
% Years with positive January 79.2 % Years with positive January 
% Years with positive Returns 70.8 % Years with positive Retums 
% Years with positive Feb-Dec 66.7 % Years with positive Feb-Dec 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp,RstYrDn -1.2% JanUp,RstYrDn -2.4% 
JanDn,RstYrDn -7.8% JanDn,RstYrDn -5.3% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 15.8% JanUp,RstYrUp 18.5% 
JanDn,RstYrUp 4.0% JanDn,RstYrUp 6.4% 
k) Pacific Index US$ January Barometer Success m) World Index US$ January Barometer Success 
Period All Jans. Jan. Up Jan. Down Period All Jans. Jan. Up Jan. Down 
1970-1993 75.0% 82.4% 57.1% 1970-1993 70.8% 87.5% 37.5% 
N 24 17 7 N 24 16 8 
% Years with positive January 70.8 % Years with positive January 66.7 
% Years with positive Returns 70.8 % Years with positive Returns 75.0 
% Years with positive Feb-Dec 70.8 % Years with positive Feb-Dec 79.2 
1970-1993 1970-1993 
Average Feb-Dec Loss Average Feb-Dec Loss 
JanUp,RstYrDn -3.0% JanUp,RstYrDn -2.2% 
JanDn,RstYrDn -7.5% JanDn,RstYrDn -4.2% 
Average Feb-Dec Gain Average Feb-Dec Gain 
JanUp,RstYrUp 22.6% JanUp,RstYrUp 14.5% 
JanDn,RstYrUp 10.9% JanDn,RstYrUp 5.1% 
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Occupational Nostalgia #3 
Bill Ziemba: 


lhad been focusing my research 
on stochastic programming, that’s 
optimization with scenarios and 
portfolio theory, Earlier | applied 
these ideas to horseracing and 
devised a now well-known sys- 
tem to beat the place and show 
markets, The idea was to use probabilities from a simple 
market, namely win, to price more complex markets like 
place, show, and exotics. And bet with the Kelly crite- 
rion. That’s led to a number of books that the racetrack 
hedge fund syndicates use, See Amazon for titles, | got 
an invitation to go to Japan and after an interview con- 
structed a deal to be the first Yamaichi Visiting Professor 
of Finance and consultant two days of the week to the 
Yamaichi Research Institute. The Leaching was good, 
doing futures and options and other financial subjects. 
The students were from the University with a number 
commuting up to the University of Tsukuba, which is a 
National University. | asked for the consulting to be in 
two study groups on Thursdays and Fridays on stock 
market anomalies end stock market crashes. It was 1988 
and they were flush with money, My wife Sandra, a keen 
economist who knows real economics, believed that as 4 
Creditor nation they were in trouble and would figure out 
a way to lose the money. She was right, and in the end 
they lost 5 trillion on stocks and 5 trillion on land, both 
items they already owned, So they simply pushed up their 
Own assets with only 3 percent of their assets abroad 

~ our book Power Japan details this. In the study groups 
we found a good crash model that predicted 12 out of 12 
of the 10+ percent crashes inthe Japanese market from 
1948-1988. These were interest rate crashes relative to 
esmings yield, so | had the model when the long bond 
fate minus the reciprocal of the price earnings ratio was 
too high. Then in the fight for the asset allocation money 
Went to bands and stocks fell. | called this the bond 


stock crash measure, And | did publish it in 1991 in the 
book Invest Japan, With the Japanese market going up 
221 times in yen and 550 times in dollars in this 40-year 
period there were 20 such 10+ percent declines. So the 
mode! did not predict all crashes but it did work when the 
signal went into the danger zone. The idea was that if it 
entered the danger zone then it was very likely that a 10+ 
percent decline would occur in the next year. | learned 
this by looking at the 1987 worldwide stock market crash 
~ there the market went into the danger zone in April 
then fell in October. So the market can continue rallying 
but the signal indicates that a big decline will occur. The 
measure called nicely the 2000 US crash, with the signal 
showing up in April 1999 a year before the start of the 
decline. Recall there was a drop in April then a rally back 
toa similar high in August 2000 then a decline of big 
proportions into 2001. Then stocks fell but earnings fell 
more in early 2002, so the mode! suggested another crash 
that occurred in 2002 with the market falling 22 percent 
and 12 percent of this in the July to September quarter. 
That's in the USA. | might add that this model predicted 
the 2008 crashes in China and Iceland because they were 
interest rate crashes but it did not predict the 2007-2010 
trouble period which was not a high interest rate crash. 
This measure ina ratio logarithmic form is known in the 
trade as the FED model from a 1996 report. Of course, it’s 
in our 1991 book, so | hope someday to get some credit 
here. Getting back to Japan, | noted at the end of 1989 
that Lhe market was in the danger zone. Getting a close 
prediction ona crash of an expensive overheated market 
is tough. Ask George Soros, who shorted Japan in 1988 
and lost a lot. But then the bond-stock model was not 

in the danger zone as stack prices were high but interest 
fates were not high enough to reach the danger zone. 
But! was not consulting for Soros, rather for Yamaichi 
The firm was nice to me, and my family. My now rather 
famous daughter Rachel, who has become a star analyst 
working with Nouriel Roubini, was nine then and went to 
Japanese public school (in Japanese), Rachel has many 
talents and languages are one of them, so she picked up 


the Japanese fast ~ some from after-school play with 
other youngsters, My wife had a nice teaching job and 
we were taken to nice dinners and other good events. | 
was taken for fugu and golf ~ yes, | was able to come last 
but not much below the 3rd finisher - so the protocol 
was handled okay. Getting back to the crash model - | 
decided to tell them about this through lishi, a rather 
good helper with perfect English (MBA from Yale) and 
perfect Japanese. He had helped me with the calcula - 
tions and wes sort of a de facto leader of the students. 
While | had given a number of talks to large audiences 
and had a whole year of talks in English-Japanese with 
Mr. Okada, one of the higher ups, and they supported 
my research ~ including a rather good 30-factor model 
of the Japanese first section — there was no way they 
would believe me and lishi that the market heading to its 
peak of 38,916 was way into the danger zone. It’s too bad 
they did not listen, as the market fell 56 percent starting 
‘on the first trading day of January 1990. And five years 
later Yamaichi Securities went bankrupt — quite a fall for 
the world’s sixth largest brokerage firm in 1989, There 
was one more interest rate rise in late December 1988 
in the governunent’s futile effort to deal with the bubble 
economy, In their poor response the government raised 
interest rates eight more months into August 1990 - and 
that in my opinion had a lot to do with the 20-year bad 
economy that they have had. | had found that most of the 
anomalies found in US markets were there in Japan, such 
as the turn of the month effect, etc. It was said that Japan 
was different. What was different was not the economics, 
but the culture. My Japan experience was a good one and 
led to three books and a number of articles, a rather good 
Nikkei put warrant arbitrage with fellow Wilmott colum- 
nist Ed Thorp, and a nine-year consulting to the research 
department relationship with the Frank Russell Company 
in Tacoma, Washington including the Yasuda Kasai model 
which | designed for them and which ushered in the era of 
multi-period stochastic programming asset-liability mod- 
els; for interested parties, see especially the 2007 ALM 
handbook | edited for North Halland with Stavros Zenios. 
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U.S. Bears’ Bets May Roil Japan’s Turmoil 


By BARBARA DONNELLY 
And MICHAEL R. SESIT 
Staff Reporters of 

THE WALL STREET JOURNAL 

U.S. investors on Nikkei put 
warrants, a relatively new invest- 
ment that reaps big profits if 
Japanese share prices plunge, have 
been sitting pretty through the Tokyo 
stock market’s recent upheavals. 

They also may be part of what is 
causing that turmoil. 

This is because the securities 
firms and banks that devised the 
Nikkei puts, such as Salomon Inc. 
and Bankers Trust New York Corp., 
often hedge them with complex 
trading strategies that increase 
pressure on Japanese stocks when 
prices are already falling. 

In effect, these institutions have 
revived the computer-driven “port- 
folio insurance” strategies that were 
a primary factor behind the U.S. 
stock market crash in October 1987. 

So far this year, the benchmark 
Nikkei index of 225 leading stocks that 
trade on the Tokyo Stock Exchange 
has plunged nearly 27% from its 
record high at the end of last year. On 
Monday, the Index fell 750.74 points, 
or 2.6%, to close at 28463.18. 

The fundamental reasons for the 
drop in Japanese stock prices are well 


known: rising Japanese interest rates, 
accelerating monetary growth, fears 
of higher inflation and a general belief 
that asset prices in Japan — both stocks 
and real estate — were overvalued. 
Waning confidence in  Japan’s 
monetary authorities has merely 
added to the investors’ anxieties. 

Now a growing number of people 
believe that portfolio insurance also 
played a part. While the Nikkei put 
warrants weren’t the cause of the 
Tokyo stock market’s drop, they say, 
the computerized hedging programs 
backing them exacerbated the decline 
once it started and added to the 
market’s volatility. 

“These puts aren’t taken lightly 
of the Ministry of Finance or any of 
the groups in Japan,” says a Canadian 
investment banker who worked on 
some of Bankers Trust’s warrants. 
“There were a lot of negotiations with 
the ministry. We got calls at two in 
morning. It was very sensitive.” 

In Tokyo, a Finance Ministry 
official says: “We can imagine [the 
puts have] some effect, but no one can 
measure them. We cannot trace how 
these products have affected the 
market. We intend to study it further.” 

Just how much the Nikkei puts 
have affected the Tokyo market is 
probably impossible to qualify. Some 


of the effect is muted by rules that 
require futures trading to close down 
once prices move 10%. 

Nonetheless, “The psychological 
effect is greater than the actual 
transactions,” a senior trader at one 
of Japan’s big securities firms says. 
“The problem is that [the big 
Japanese brokers] can’t do it and 
can’t explain it to their clients.” 

Japanese securities firms, which 
lack the technology and the expertise 
to do much computerized program 
trading, have criticized U.S. firms in 
recent months for disrupting the 
market by using such techniques. 
While the Americans don’t deny they 
use computerized trading strategies, 
they vehemently deny that their 
practices contribute to the Tokyo 
market’s violent fluctuations. 

Four Nikkei put warrants have been 
listed on the American Stock Exchange 
since early January. The first was 
offered by Denmark and underwritten 
by Goldman, Sachs & Co.; the others, 
by Bankers Trust with one and Salomon 
with two, quickly followed. 

The  Annex-listed warrants 
entitle investors to a payment in U.S. 
dollars, if the Nikkei stock average 
falls below the exercise or “strike” 
price specified in the war- 
Please Turn to Page C10, Column 5 


Bearish Bettors in U.S. May Be Partly 
Behind Upheavals in Tokyo 


Continued From Page C1 
rants’ prospectuses. Trading in the 
Annex puts has been active, at times 
accounting for as much as 40% of 
the Annex’s daily volume. 

Other Nikkei puts have been 
trading in Canada and London for 
more than a year, while private 
placements of Nikkei put options 
have been available to U.S. and other 
institutional investors since 1988. 

Combined, the publicly listed U.S. 
and Canadian put warrants represent a 


Source: The Wall Street Journal, 1990 


roughly $2 billion bearish bet against 
the Nikkei index, figures William 
Ziemba, a professor of management 
science at the University of British 
Columbia. Nobody knows for sure the 
value of the privately placed puts, but 
investment bankers involved in some 
private deals say the total could be 
about to times the listed warrants. 

To hedge the exposure on the 
Nikkei puts, the securities firms and 
banks use a few different strategies, 
including portfolio insurance, or 


“dynamic hedging,” as it is called 
now. This isn’t really insurance; 
rather, it is a trading strategy that sells 
progressively more stocks as the 
market declines and buys them back 
as prices rally. The goal is to limit 
losses in a falling market, but not 
leave the investor too far behind when 
prices are gaining. Without these and 
other strategies to hedge the puts, 
Issuers would have risked losing 
hundreds of millions of dollars in the 
recent plunge in Japanese stock prices. 
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NIKKEI PUT OPTIONS GOOD BUY 
FOR FOREIGN FUND MANAGERS 


Why foreign investors are finding Tokyo an attractive place to insure their global portfolios. 


ne of the great ironies in the 
Q esate run-up of 

Japanese stocks is that each 
rise makes it easier for foreign inves- 
tors to bet that the market might one 
day start back down. 

This has been made possible, in 
part, by infinitely bullish Japanese 
players who don’t shy away from 
taking the other side of the bet from 
foreign investors. 

The foreigners are buying put op- 
tions to sell the Nikkei 225 stock 
average. A put is an option to sell ata 
specified price within a specified 
period. Buyers pay a fixed premium 
and profit by each dollar that the Nik- 
kei falls below a certain level, or strike 
price. If the index hasn't fallen to the 
strike price when the option expires, 
the premium is forfeited. 

The draw for large investors is two- 
fold. First, the puts provide a cheap 
way to bet against the Tokyo market. 

For about 6 percent of the Nikkei, 
or close to ¥2,000, one can buya three- 
year option with a stock price equal to 
the index's current level. If the index 
falls just six percent the investor 
breaks even. 

Second, they are less risky than a 
second way of betting against the 
market called short selling; selling 
borrowed securities into the marketin 
the hopes of a price drop. 

One reason for the puts’ low cost is 
the large number of Japanese cor- 
porations and individuals willing to 
take the other side of the trade. 

According to Koichi Kozu, assistant 
manager for stock index futures and 
options with Nomura Securities Co., 
the largest buyers of call options are 
Japanese trading houses. Japanese 
corporations and private individuals 
are also large buyers. 

There are several ways, other than 
bullish sentiment, to explain the 
seemingly unlimited Japanese ap- 
petite for calls. 

Many Japanese companies, includ- 
ing major securities houses, book the 
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premium, now about 20 percent of 
the option’s exercise price, as profit. 
Very few investors will miss an easy op- 
portunity to increase yield on their 
portfolios, according to Kozu. 

In addition, David Baran, vice presi- 
dent of derivative products with 
Shearson Lehman Hutton, said the 
Japanese investors, including major 
securities houses, take the premiums 
and plow them back into the market. 

“They (Japanese investors) are long 
and, they're going to be long forever,” 
Baran quipped. 

Another attractive feature of the op- 
dons market is its relatively low mar- 
gin requirements. With margin 
requirements in many foreign 
markets rising in response to concern 
over the Crash of 1987, Tokyo remains 
a safe port for leverage. 

A final explanation is that the level 
of volatility in New York is nearly 
double that in Tokyo. In volatile 
markets, options sellers demand 
higher premiums to compensate for 
the increased risk of being forced to 
sell their stock or compelled to pur- 
chase additional shares. 

The biggest foreign buyers have so 
far been money managers and high 
net worth individuals. The put op- 
tions are only sold in private place- 
ments and large dollar amounts. 


The money managers say the puts 
are a cheap hedge for global 
portfolios. If the Japanese market 
sneezes and the rest of the world 
catches cold, these money managers 
will have their bases covered. 

“It's a good, cheap way of insuring 
our portfolio,” said Tiger Fund's 
Julian Robertson (see interview). 

Moreover, since losses on options 
are limited, unreformed Tokyo bears 
can wait for the bubble to burst from 
a safe distance. 

The second, riskier way of betting 
against the Tokyo market, selling 
stock short, cannot be done in Tokyo, 
because the Tokyo Stock Exchange 
charges exorbitandy high fees for 
loaning securities to member firms to 
discourage the practice. 

Most of this business is done from 
the London subsidiaries of foreign in- 
vestment banks. Shearson's book of 
Japanese equities loaned is “huge,” 
Baran said. This indicates that there 
are still many people around the 
world who believe the Nikkei is due 
for a fall. 

When asked if he thought Japanese 
investors took part in the short selling 
to hedge their portfolios, Baran said 
he doubted it. 

On the contrary, Baran said a 
favorite trading technique among 
Japanese investors is so-called front- 
running, in which an investor uses fu- 
tures contracts to purchase stocks with 
money he does not yet possess. 

A big question for a buyer of a put 
is what happens if your worst fears are 
realized and the Nikkei does collapse. 

First there is the risk that the yen will 
collapse with it. Another risk is that in 
a general market collapse there is no 
guarantee that the Japanese counter- 
party will be able to deliver on the 
trade. 

But then as one American banker 
said to me, if Nomura goes under, 
we'll have a lot more to worry about 
than options contracts. 

(Stephen Lukow) 
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Buying Stock? Consider Turn-of-the-Month Effect 


HEARD 
ON THE 


STREET 
By Michael Gonzalez 
Staff Reporters of 
THE WALL STREET JOURNAL 

You’ve heard of the “October effect,” 
when the stock market tends to go down. 
And of course there is the “January effect,” 
when it tends to go up. Not to mention the 
axiom: “Buy on Friday and sell on Monday” 
(or is it the other way around?). 

Now Frank Russell Co., a pension-fund 
consulting firm in Tacoma, Wash., is 
advancing the “TOM effect,” short for the 
turn-of-the-month, when Russell says stoc! 
prices tend to go up because that is when 
investors receive cash to put into the market. 

The idea isn’t completely original. 
Earlier studies by analysts Norman Fosback, 
Yale Hirsch, and Robert Ariel have come uj 
with similar results. But the new study, by 
Russell analyst Chris R. Hensel and William 
T. Ziemba of the University of Britis! 
Columbia, is one of the most extensive, 
covering the Standard & Poor’s 500-stoc! 
index and its predecessors from 1928 to 1993. 

Messrs. Hensel and Ziemba found that 
the turn-of-the-month period, defined as the 


As the Month Turns, the DJIA Exceis 
Average percentage price change in the DJIA per day for every month, 1915-1994 
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Source: Girinyl Associates —— $ 


last trading day of the month through the 
first four trading days of the next month, 
generated an average daily return six times 
that of the daily average for the rest of the 
days of the month. 

“It appears that this happens because of 
the flow of funds,” says Mr. Hensel. 
“There’s a number of things that happen at 
the end of the month — people get paid, 
dividends are paid, principal payments,” he 
says, as well as corporate contributions to 
pension or retirement funds. 

The Russell study found average daily 
returns for the turn-of-the-month period were 
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0.1236%, as opposed to an average daily 
return of 0.0186% during the 65-year test 
period. The probability of that happening, 
say the authors, is below one in a thousand, 
making the phenomenon statistically 
significant. 

The study follows daily closing 
prices on the Standard & Poor’s 500-stock 
index 1957, when the 
value-weighted index grew to 500 stocks, 
and on its predecessor, the S&P 90-stock 
composite, since 1928. 


since 


Mr. Hensel says psychology plays a 
Please Turn to Page C2, Column 3 


Turn, Turn, Turn: To Every Stock Price There 
Is a Reason for the Month-to-Month Price Jump 


HEARD 
ON THE 


STREET 
Continued from Page C1 
part, too. “People tend to aggregate things 


according to the calendar,” he says. “People 
observe what has happened to a stock during 
a month and then make decisions whether to 
invest or not.” 

Birinyi Associates, a Greenwich, Conn., 
firm that monitors stock movements, finds 
similar results for the period 1915 to 1994. 
The average daily price change in the Dow 
Jones Industrial Average surges just before 
month-end, when it jumps from minus 
0.02% on day 28 to plus 0.01 on day 29, 
0.12% on day 30, peaking at 0.19% on day 
two of the next month, then falling to minus 
0.01% on day seven. The biggest loser, on 
average, is day 19. 

One potential weakness in both the 
Russell and Birinyi data is that they 
represent price change only, and don’t count 
the dividends 
indexes would receive. One Birinyi analyst 
said that counting the dividends daily, which 


that an investor in such 


only became possible within the past few 
years, the 
turn-of-the-month effect. 


wouldn’t cancel out 


Source: The Wall Street Journal, 1995 


The Russell study found evidence of 
other “effects” known to Wall Street, such as 
the January effect, which analysts have long 
attributed to the halt in year-end selling for 
tax reasons by individual investors as well as 
of new money from corporate 
pension and retirement funds. 


inflows 


The October effect — when stocks go 
down — also turned up in the Russell study, 
with the October average daily return during 
the TOM period the lowest of any month. 
Thomas M. Keresey, chairman of Palm 
Beach Investment Advisers in Palm Beach, 
Fla., says one explanation for the October 
effect is that money managers tend to sell 
stocks during that month to raise cash before 
the end of their fiscal year, typically Oct. 31, 
in order to make once-a-year dividend 
payouts. 

But Mr. Keresey says he ignores these 
kinds of effects. “It doesn’t make any 
difference to me. When I buy a stock I buy it 
because it’s going to go to a certain price 
from a long term perspective,” he says. 

Several traders and money managers 
say they have heard of the TOM effect, but 
note that it is difficult to take advantage of it. 
“You can’t predict what’s going to happen to 
a particular stock,” says Anthony Conroy, 


vice president of global investment 
management at Bankers Trust. 

The Russell study’s authors take it a bit 
further and suggest that traders attempt to 
move into stocks or stock futures just before 
the turn of the month, and back into cash 
However, as the study 


acknowledges, suggestion doesn’t 


afterward. 
their 
include the impact of transaction costs, 
which would likely be considerable. 

Mr. Hensel even tried doing something 
similar — shifting from stocks to cash and 
back — with the S&P index funds and 
money-market funds at the Vanguard mutual 
fund group. But only until the Valley Forge, 
Pa., large mutual-fund family caught on. “I 
was overtrading according to their rules, so 
they basically asked me to stop,” he says. 

However, he says that the average 
investor can still time a monthly mutual-fund 
contribution check, putting it in the mail later 
in the month, say around the 20th, so it will 
go into the fund just before stock price 
returns historically turn up. 

But Jeffrey Rubin, an analyst with Birinyi 
Associates, is sceptical of doing even that much. 
The phenomenon is “obviously true,” he says. 
But he adds: “This is more for the short-term 
trader that wants to have an edge on the market.” 
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MAKING DOLLAR-COST AVERAGING EVEN MORE 
PROFITABLE 


THE TRIED-AND-TRUE DISCIPLINE CALLED DOLLAR-COST AVER- 
AGING works because it is so simple: Anyone who puts a fixed dollar amount 
into stocks or mutual funds at regular intervals will buy more shares when the 
price per share is lower and buy fewer when the price per share is higher, lead- 
ing to a lower overall cost per share. But recent studies show that it pays to 
add just a little complexity. Making those regular purchases at the right time 
of the month can make dollar-cost averaging work even better. 

According to Yale Hirsch, editor of The Stock Trader’s Almanac, over the past 
six decades stocks have done 2.5 times better in the first half of the month 
than in the second half. A 1996 study by Chris Hensel, client executive at the 
Frank Russell Company, and William Ziemba, a professor at the University of 
British Columbia, found an even bigger difference, with average returns of 0.07 
percent in the first half of the month and negative 0.02 percent in the second 
half. 

According to Hensel and Ziemba, the biggest returns come between the last 


trading day of one month and the fourth trading day of the following month. 


Daily returns then average 0.12 percent versus 0.02 percent daily for the entire 
month. 

Taking advantage of this “turn of the month” effect, Hensel says, by investing 
on the third- or second-to-last day of the month. For example, a dollar-cost 
averager would time his check to arrive by September 27. His dollars would 
then be invested when shares are relatively cheap and before prices climb. 
“The turn of the month is when the lion’s share of the gains has happened over 
time,” says Hensel, “so it makes sense for investors to try to be invested by 
then.” 


—J.B. 
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Investment Results from Exploiting Turn-of-the-Month Effects 


1 Introduction 


Investment advisors have argued that US stocks have substantial rises at the turn 
of the month.’ The turn of the month (called TOM) is defined to be the last 
trading day of the previous month and the first four trading days of the new 
month. In this Commentary, we examine the monthly return patterns in the S&P 
500 from 1928 to 1993 to investigate this phenomenon. We then examine various 
investment strategies that could practically exploit the turn-of-the-month effect. 
The results are of potential use to institutional investors concerned with timing of 
stock purchases and sales and for long-term investors who utilize the S&P 500, or 
similar benchmarks, for performance measurement of portfolios. 

The intense investigation of seasonal anomalies in US and other security 
markets brings up the issue of data mining.’ Merton (1985), Black (1986, 1992), 
and Lo and MacKinlay (1990) in particular discuss the dangers of finding what 
appear to be genuine anomalies but may simply be random data variations. 
Perhaps the best remedy against data snooping is new data and convincing 
reasons for the effects. 

One explanation for the high returns at the turn of the month is the 
considerable cash flows coming into the stock market during this time.’ Many 
salaries, dividends, principal payments, and debt interest are payable on the last 
and first days of the month. There are also institutional, corporate, and pension 
fund purchases during the turn of the month. These cash flows vary by month and 
lead to higher average returns in January, which has the highest cash flow.’ 

The data used in this study consists of the daily closing prices of the S&P 500 
stock index for the 65-year period from February 1928 to June 1993, and was 
supplied by Data Resources Incorporated. The S&P 500 is a value-weighted index 
of large capitalization US stocks. Since March 1957, it has consisted of 500 large 
stocks weighted by market value (price times number of shares outstanding). 
Prior to then it consisted of 90 large stocks. Index futures contracts on the S&P 
500 have been traded since 1982 on the Chicago Mercantile Exchange. Futures 
options on the S&P 500 are traded at the Chicago Mercantile Exchange, and 
options are traded at the Chicago Board Options Exchange. The S&P 500 is an 
appropriate index for studying these anomalies, since it is value-weighted, broadly 
diversified and representative of the market portfolio, and has a very long history. 
Additionally, the S&P 500 is often used in passive portfolio management to 
represent the market and is a common performance benchmark for US stock 
portfolios. 


2 The Return Patterns in the S&P 500 


To avoid data-snooping biases, we have defined the turn of the month as Ariel 
(1987) did in his study of the cash markets from 1963-1981. 


Trading 
Category Acronym Days 
Turn of the month TOM -1 to +4 
Second week SW +5 to +9 
First half of the month FH -1 to +9 
Rest of the month ROM +9 to -2 


Thus, TOM is the last trading day of the previous month and the first four trading 
days of this month. Analyzing the S&P 500 data from 1928 to 1993, we found 
that the five trading days during TOM had very high returns. Three of these five 
days had returns significantly above average. Moreover, all of the rest of the 
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2 The Return Patterns in the S&P 500 


month had returns near or below the average and none of these days had returns 
significantly above average. 


Figure 1 
Average Daily Returns in the S&P 500 Cash Market 
by Trading Day of the Month 
(February 1928 to June 1993) 
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Figure 1 shows the average return pattern by trading day, and average daily 
return. The TOM (days -1 to +4) had mean returns significantly above average. 
However, over the 65-year period studied here, the five-day period -2 to +3 had 
even higher returns. To examine the consistency of returns by day, we calculated 
the mean daily return and standard deviation of daily returns for rolling 60-month 
windows. Examples of two trading days are presented. Figures 2a and 2b show 
the rolling 60-month returns and standard deviations of the contrasting trading 
days -1 and -5. For the high-return day -1, there are consistently positive returns 
with relatively constant standard deviation (after the 1930s). For the low-return 
day -5, there are consistently negative retums and somewhat higher and more 
variable standard deviations. 
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Figure 2a 
Rolling 60-Month Returns and Standard Deviations for Trading Day -1 
(February 1928 to June 1993) 
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Figure 2b 
Rolling 60-Month Returns and Standard Deviations for Trading Day -5 
(February 1928 to June 1993) 
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2 The Return Patterns in the S&P 500 


The average return per day for the 65 years was 0.0186%. The mean returns 
during the turn of the month were over six times as high at 0.1236%, which has a 
t value of 5.94 for the hypothesis that the turn of the month days have returns 
above the mean. The probability of finding a daily mean return of 0.1236%, when 
the true daily mean return is 0.0186%, is less than one in a thousand. The first half 
returns were also significantly above average.’ However, during the rest of the 
month, the returns were significantly below average and also significantly 
negative. 


Table 1 
Average Daily Returns and t values for the S&P 500 
(February 1928 to June 1993) 


Average Daily 
Trading Days Return (%) t value 
All days 0.0186 0.00 
TOM 0.1236 5.94 * 
FH 0.0703 4.13 * 
ROM -0.0235 -3.71 * 


* Asterisks denote t values for returns that are significantly 
different from the average at the 5% level of significance, 
using a one-tail t test. 


There was also a significant seasonality in the S&P 500 returns. January, 
March, May, and July had mean returns during the turn of the month significantly 
above the overall average (see Table 2). In every month, the mean daily returns 
during TOM were higher than the average daily return. January and July also had 
significantly higher mean daily returns during the first half of the month than the 
average daily return. The mean daily returns in the first half were positive in all 
months except September. The rest of the month had negative mean daily returns 
in all months except January, June, August, and December. The rest of the month 
for September and October—ited in the media as a time for stock market 
losses—did have large negative mean daily returns, but only September was 
significantly below average. May and November also had significantly negative 
mean daily returns during ROM. 
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Table 2 
Average Daily Returns by Month, 
During the Turn, First Half, Rest of the Month, and Whole Month 
(February 1928 to June 1993) 


Average Daily Returns (%) 


TOM FH ROM All Days 
S&P 500 Index -1 to +4 -1 to +9 +10 to -2 -1 to -1 
January 0.2061* 0.1025* 0.0359 0.0651 
February 0.0807 0.0170 -0.0214 -0.0024 
March 0.1876* 0.0768 -0.0212 0.0208 
April 0.0503 0.0566 -0.0169 0.0161 
May 0.1653 * 0.0819 -0.0836* -0.0107 
June 0.1287 0.0669 0.0033 0.0315 
July 0.2258* 0.1697* -0.0050 0.0738* 
August 0.0645 0.0672 0.0129 0.0364 
September 0.0976 -0.0175 -0.0978* -0.0605* 
October 0.0445 0.0632 -0.0787 -0.0178 
November 0.1108 0.1038 -0.0821* 0.0071 
December 0.1217 0.0564 0.0599 0.0584 
All Months 0.1236* 0.0703* -0.0235* 0.0186 
*Asterisks denote returns that are significantly different from the average daily return for all days 


(0.0186%), at the 5% level of significance, using a one-tail t test. 


The evidence is that all of the gains in the S&P 500 from 1928 to 1993 were 
at the turn and during the first half of the month. The returns in the rest of the 
month were nonpositive. We investigated the frequency distribution of the largest 
percentage gains and losses for various times of the month. We found that the 
higher average returns during the FH in the 1928 to 1993 sample were partly 
composed of a higher frequency of very large returns and a lower frequency of 
very low returns. However, the results were not strong enough to have statistical 
significance. Thus, large gains and losses have at most a minor effect on the result 
that all the gains were in the FH during 1928 to 1993.* 


3 Investment Strategies 


The previous sections have shown that the TOM and FH have very strong daily 
returns, on average, and these returns are not the result of a few large gains and 
losses. In this section, we compare the return percentages, wealth relatives, and 
correlations of large stocks, small stocks, various types of bonds, cash, and two 
investment strategies. The strategies are to invest in the S&P during either the 
TOM or FH and in cash for the remainder of the month. This allows us to 
compare these investment strategies with other buy and hold investments such as 
S&P 500 index and small cap funds. No transaction costs were included. 
However, institutional investors could use futures to implement these strategies, 
dramatically reducing the impact of transaction costs.” Such an investor could use 
futures to adjust the net long S&P 500 to cash position during periods with low or 
negative average returns. The comparison of different idealized strategies 
provides an estimate of the potential value of investing based on the historical 
evidence of higher mean returns during TOM and FH. 
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3 Investment Strategies 


Table 3 
Returns and Growth for Various Investment and Seasonal Strategies 
(February 1928 to June 1993) 


Investment Monthly Avg. Monthly St. Yearly Avg. Yearly St. Growth of $1 
Strategy Return (%) Deviation Return(%) Deviation Investment 
Large Cap 0.79 5.80 9.50 20.11 439.13 
Small Cap 0.96 8.71 11.53 30.18 1,483.63 
TOM + .8 Cash 0.84 2.54 10.13 8.79 758.36 
FH + .6 Cash 0.92 3.64 11.06 12.62 1,290.97 


Table 3 compares Large Cap and Small Cap returns with the returns from two 
investment strategies. Ibbotson Associates supplied the total return data for the 
large capitalization (S&P 500) and small capitalization (bottom 20% of 
companies, capitalization weighted). The final two strategies (TOM + 0.8Cash 
and FH + 0.6Cash) invest in the S&P 500 during the turn of the month or the first 
half and then in cash for the remainder of the month. It is assumed that these 
strategies are in cash 80% and 60% of the month, respectively. These two 
strategies do not include dividends for the periods that they are invested in the 
stock market. Hence their returns are a conservative estimate of the true total 
returns. 

The strategies of being long in the S&P 500 during TOM or the FH and then 
in cash had consistently high mean returns that were mean-variance superior to 
the buy-and-hold Large Cap strategy. That is, the TOM and FH strategies had 
higher returns and lower standard deviations than an investment in the S&P 500 
for the entire period. 

The growth of $1 invested in the various investment strategies for the entire 
1928-1993 period is shown in the last column of Table 3. The results show the 
growth of one dollar over the 65 years of the sample. In interpreting these results, 
one must consider the number of trading days in each period; TOM had five days, 
FH had 10 days, and the whole month had about 20 to 22 days. The total growth 
in the small capitalization index, $1,483.63, was higher than the TOM-plus-cash 
and FH-plus-cash strategies that returned $758.36 and $1,290.97, respectively. 
The TOM and FH strategies dominated Large Cap stocks. Adjusting for risk via 
standard deviation, points to the superiority (high returns and relatively low 
standard deviations) of the TOM and FH strategies during the 65-year sample 
period. 

Next we calculated the correlations of these strategies with other investments 
to see if the correlations make these strategies more or less attractive to 
institutional investors. If the correlations are relatively low with their other 
investments, these strategies can provide additional diversification when added to 
their current holdings. Ibbotson Associates total return data was used for large and 
small stocks, high-yield corporate bonds, long-term (twenty years) and 
intermediate-term (five years) government bonds, and cash measured by the 90- 
day T-bill return. 

Table 4 displays the correlations between the various investment returns from 
February 1928 to June 1993. The TOM- and FH-plus-cash strategies have a 
relatively low correlation with large and small capitalization stocks—namely, 
0.46 and 0.38 for TOM and 0.67 and 0.57 for the FH, respectively.” These 
correlations are of a level that would provide attractive diversification benefits 
(reduced risk) when included in a traditional portfolio of stocks and bonds. Hence 
these strategies may be considered as separate asset classes in mean-variance or 
related asset allocation studies just as smal] or large capitalization or foreign 
stocks might be. 
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Table 4 
Correlations for Various Investment and Seasonal Strategies 
(February 1928 to June 1993) 


Small |Corporate|Long Govt} Intermed. 
Govt Bond} Cash 


Correlations +.8 Cash|+.6 Cash 


Large Cap 
Small Cap 
Corporate Bond 0.22 
Long Govt Bond 0.17 
Interm. Govt Bond 


Cash 0.00 
TOM+.8Cash | 0.46 1.00 
FH + .6 Cash 0.67 0.69 | 1.00 


4 Concluding Remarks 


There was a substantial turn-of-the-month effect in US large capitalization stock 
prices as measured by the S&P 500 during the 65-year period 1928-1993. The 
results show that the mean returns in the stock market were significantly positive 
in the turn and first half of the month and significantly negative in the rest of the 
month. 

The cumulative wealth effects of investment during various time periods 
magnify the effects. The results indicate that the total return from the S&P 500 
over this 65-year period was mostly received during the turn of the month. The 
strategy of being long the S&P 500 during the TOM or the FH and T-bills 
otherwise had very high total returns (exceeded only by small stocks); and when 
risk is considered, dominated all the strategies considered, including the small 
stocks. 

The results point to an advantage from investments in TOM and the FH, as 
opposed to ROM, on average. However, there may be different results in a 
particular year depending upon economic fundamentals and economic news 
shocks. 
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6 Endnotes 


" See Merrill (1966), Fosback (1976), and Hirsch (1986). 


* Recent literature surveys of trading anomalies appear in Hawawini and Keim 
(1995) and Ziemba (1994). 


° Ogden (1987, 1990) discusses empirical support for the flow of funds into the 
stock market hypothesis. Cadsby and Ratner (1991) provide further support for 
the turn of the month cash flow hypothesis using data from Canada, the United 
Kingdom, Australia, Switzerland, and West Germany. Cadsby and Ratner found a 
significant turn-of-the-month effect on trading days -1 to +4 in these countries. 
Ziemba (1991) studied the turn-of-the-month effect and the cash flow hypothesis 
in Japan. 


* The following explanations for the turn of the month have also been advanced: 
behavioral (Penman 1987); inventory adjustments of different traders (Rock 1989 
and Ritter 1988); the timing of trades by informed and uninformed traders 
(Admati and Pfleiderer 1988); specialists’ strategies in response to informed 
traders (Admati and Pfleiderer 1989); seasonal tax-induced trading (Lakonishok 
and Smidt 1986); and window dressing induced by periodic evaluation of 
portfolio managers (Haugen and Lakonishok 1988 and Ritter and Chopra 1989), 


* Ariel (1987) has documented the TOM effect for small- and large-capitalization 
stocks for the 19 years from 1963-1981. His data consisted of the equal- and 
value-weighted indexes of all NYSE stocks from the Center for Research on 
Security Prices (CRSP) tape. The first half of the month had all the gains. ROM 
had negative returns. Hence, investment in the first half of the month provided 
more than all the year’s stock market gains. Some additional support for Ariel’s 
findings for the period from 1982-1988 appears in Cinar and Vu (1991), Keim 
and Smirlock (1987), and Linn and Lockwood (1988). Lakonishok and Smidt 
(1988) investigated daily returns in the Dow Jones Industrial Average from 1897 
to 1986. Using this price weighted average of 30 stocks, they also found high 
returns at the turn of the month. 


ê The shifting of higher mean return to days -2 to +3 from -1 to +4 may be related 
to futures anticipation of this effect. Hensel, Sick and Ziemba (1994) using data 
from 1982-1992 found that the TOM effect is anticipated to some extent in the 
futures on days -4 to -2. 


’ We also examined the TOM, FH and ROM returns by decades from the 1930s 
through the 1980s. Although there was some variability across decades, there 
were consistently high returns during TOM and the FH in all the decades. 


* This result is consistent with the hypothesis that large gains and losses are the 
result of financial news information shocks that occur randomly in time; and they 
are distributed in TOM, the FH and ROM in proportion to the relative number of 
days in these trading periods. 


° Anticipation by the futures market of these stock market gains would need to be 
considered using results such as those in Hensel, Sick and Ziemba (1994). 


"° These correlations can also be estimated theoretically. Suppose the value of the 
small capitalization portfolio and S&P 500 portfolio follow a joint geometric 
random walk with a constant drift, correlation, and variances throughout the 
month. Also suppose that investing in cash yields no return variance. That is, 
interest rates are constant. Let f denote the fraction of days of the month for 
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which the trading is invested in stock. For the TOM-plus-cash strategy, f is 
approximately 0.2, and for the FH-plus-cash strategy, f is approximately 0.4. Let 
the correlation between the returns of the fully invested S&P 500 portfolio and 
either the small capitalization or large capitalization portfolio be p. Then the 
correlation between the returns of the TOM- and FH-plus-cash trading strategies 
with the returns of S&P portfolio or the small capitalization portfolio equals vf p. 
The correlations of the fully invested portfolio with the S&P 500 portfolio is 

p = 1, so we would expect the TOM-plus-cash and FH-plus-cash strategies to 
have correlations of approximately J0.2 = 0.45 and /0.4 = 0.63 with the large- 
capitalization stocks. This is consistent with the data, suggesting that the variance 
of the S&P 500 portfolio is the same during the TOM or FH as it is throughout the 
rest of the month. Similarly, by Table 4, the correlation between the S&P 500 and 
the small capitalization portfolio is p = 0.86 over the whole month. This implies 
correlations between the small capitalization portfolio and the TOM-plus-cash and 
FH-plus-cash strategies of 0.39 and 0.54, respectively. These are consistent with 
the empirical evidence. However, the theoretical correlations are slightly less than 
the empirical correlations for the TOM-plus-cash strategies, which suggests that 
the variance rate of the S&P 500 portfolio increases slightly during the turn of the 
month. 
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Summary of 

"Playing the Turn of the Year Effect with Index Futures” 
by Ross Clark and William T. Ziemba, 

November-December issue of Operations Research 


For twenty years since Gene Fama’s doctoral dissertation at the 
University of Chicago in the early 1960's, the academic financial community 
considered the case for the efficient market hypothesis air tight. Current 
prices of stock, bonds, treasury bills, and other investments were the right 
prices; they fully reflected all the publicly available information concerning 
their value; the best estimate of their future value was the current price. 
You would do just as well throwing darts at the stock pages as hiring a 
professional portfolio manager. Indeed when three executives of Forbes 
Magazine threw 28 such darts in the summer of 1967 they expected their 
portfolio to more or less match the market indices. By the summer of 1984 
their $28,000 had grown to $132,000, excluding dividends, a 370% gain, which 
was more than ten times the gain of the Dow Jones industrials. Why did they 
do so well? After all, according to the prevailing wisdom supplied by the 
capital asset pricing model, one can only achieve higher mean returns by 
bearing more risk. 

Risk comes in two forms: 1) market risk, measured by movement of the 
portfolio as compared to the market ~ the infamous beta (the covariance of the 
portfolio with the market divided by the variance of the market) =- one cannot 
do anything about this risk; and 2) diversification risk, which for 
independent securities drops off with the reciprical of the number of 
securities. With 28 fairly independent securities the diversification risk is 
small (about 8% of its maximum). The portfolio beta was about par equal to 


the market beta of one. 
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The portfolio did so well because it was the small firm effect in 
disguise. Small capitalization stocks, what you emphasize when you put equal 
dollar amounts on a relatively large number of random securities, simply 
return more on average than large capitalization stocks such as those in the 
Dow. In 1981 Rolf Banz and Marc Reinganum, both Chicago Ph.D. students, 
showed that even when you risk adjust, the small stocks return much more than 
the big stocks. Thus the field of stock market anomalies or, as we now call 
them, empirical regularities gained some respectability. 

Dozens of studies have since documented numerous manifestations of the 
data that seem to violate the efficient market hypothesis: low price 
earnings, Value Line #1's, closed end mutual funds selling at large discounts, 
S&P 500 additions, stocks that receive no dividends and the like seem to have 
excess returns. Purists are still trying with little success to find risk 
measures that bring these violations to the theory back into line. It is a 
rough task - how does one explain that over the past ninety years fully 51% of 
all the gains in the Dow Jones industrials have occured on the 8 to 10 trading 
days each year that preceed the major U.S. holidays? And that the probability 
that the Dow Jones index will fall on a Monday following a negative Friday is 
nearly 75%? 

Perhaps the most pervasive of the anomalies is the small firm effect. 
From 1926-1985 a $1 invested in T-bills grew to $7.47, long term government 
bonds to $11.03, long term corporate bonds to $16.55, common stocks to $279.12 
and small stocks (those New York Stock Exchange securities in the bottom fifth 
in capitalization) to $1,241.24. The small stocks returned nearly five times 
the average stocks and more than ten times the largest stocks. A mutual fund 
that invests precisely in this way, essentially buying and holding the NYSE 
bottom fifth, of which Professor Fama is a major architect, manages about $5 


billion in pension money. 
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In 1983, Donald Keim, another Chicago Ph.D. student, discovered that at 
least half of small stock gains occured in January. Richard Roll at UCLA and 
others then found that most of these January gains occurred on five trading 
days (-1, the last day in December, and +1,...,+4, the first four days in 
January). On these days the small stocks simply leap enormous amounts 
relative to the big stocks, and have consistently done so nearly every year 
for decades. In 1986 Jay Ritter found that during the fourteen years, 
1971-1984, the difference of small stocks versus big stocks over the nine 
trading days (-l to +8) ranged from 3.0 to 41.5% and averaged 9.9%. The risk 
premiums are higher on these days for the small stocks but that alone does not 
explain these extremely high returns. Indeed, recent studies have shown that 
risk as measured by the capital asset pricing model is rewarded by increased 
returns only in January. 

The causes of the turn of the year effect seem to be: (a) tax loss 
selling, (b) renewed buying interest in small stocks in the new year because 
of the availability of excess cash balances that, because of the low trading 
volumes, causes upward price pressure, (c) high transactions costs that 
prevent these patterns from being arbitraged away, (d) portfolio manipulations 
and (e) turn of the month and quarter price rise effects. 

I have been interested in the study of speculative markets, be it 
blackjack, lotto games, horseracing or financial securities. In each case the 
procedure is to (1) identify an edge and (2) to determine a proper investment 
strategy. 

There is strong evidence that small stocks outperform large stocks at the 
turn of the year. Yet the transactions costs eat away most, if not all, of 
the potential gains. The transactions costs on index futures is a tenth or 


less of the corresponding basket of securities. Hence, a strategy that should 
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be very profitable is to hold long positions in a small stock index and short 


positions in large stock indices. 


Stock index futures began trading in the U.S. in 1982 and the number of 
different contracts available and their volume has been increasing steadily. 
The ideal way to play the turn of the year effect is to be long the smallest 
stocks and short the largest stocks in liquid index contracts. Our experience 
is with the spread between the Value Line and S&P indices called the VL/S&P 
spread. This is not an ideal way to play the effect, but it is the best we 
have found so far and it has been successful. The Value Line index is an 
equally weighted geometric weighted average of the prices of nearly 1,700 
securities with futures traded on the Kansas City Board of Trade and futures 
options traded on the Philadelphia Stock Exchange. IBM and the smallest 
company in the index are treated equally in the weighting. The Value Line 
index has a downward drift of about 5.5% per year relative to the component 
securities in the index because of the geometric averaging, due to the 
geometric-arithmetic averaging inequality. The S&P 500 futures contract 
is traded on the Chicago Mercantile Exchange. It is value weighted. Hence, 
IBM and the other large stocks count much more than the medium size stocks at 
the bottom of the index. Hence, in a crude fashion, the VL/S&P spread gives 
you the small stocks long and the big stocks short. The bigger the stocks 
are, the more short they are. However, all the small stocks and medium stocks 
are held in the same proportion. 

When should you trade? We have found that the rule: buy the spread on 
the first closing uptick, starting on December 15 and definitely by the 17th, 
and sell on January 15, has worked well. Waiting until (-1) now seems to be 
too late: possibly sufficient finance professors and their colleagues, and 


other students of the turn of the year/January effect move the VL index. 
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There seems to be a bidding up of the March VL future price relative to the 
Spot price. By January 15th, the biggest gains are over and the risks 
increase. During the ten years 1977-1986, the spread dropped 0.92 points, on 
average, with a high variance. The projected gain from a successful trade is 
0-5 points ($500 per point), and averages 2.85 points or about $1350 per 
spread. On average, the December 15 to (-1) day gain on the spread, is 0.57 
points. However, it was 1.05 in 1985 and 3.15 in 1986 which may reflect the 
fact that with the thin trading in the VL index. The market can be moved with 
a reasonably small number of players, who are learning about the success of 
this trade, i.e. the basis was bid up anticipating the January move. Clark 
and I made this trade successfully in 1984/85, 1985/86 and 1986/87. For our 
1986/87 play we attempted to optimize our investment using modifications of 
the capital growth or Kelly criterion which seems to have the most desirable 
properties for repeated investments over time. The procedure allows one to 
balance expected gain with probability of success and leads to a prescription 
of the optimal number of contracts to hold consistent with one's risk-reward 
preferences. 

We employed the concepts in a $100,000 speculative account for a client 
of CARI Ltd., a Canadian investment management company. We decided to 
purchase five VL/S&P spreads to approximate a slightly less than 25% 
fractional Kelly strategy. Watching the market carefully, we bought these on 
December 17, 1986 at a spread of -22.18 which was very close to the minimum 
that the spread traded. On the 15th the spread closed at -20.90 and on the 
16th at -22. The spread increased in value to -18.15 at the end of the year 
in a flat and declining stock market for a gain of 3.85. In January the stock 
market took off in an impressive style with ten consecutive up days. The 


Spread continued to gain and we cashed out at -16.47 on the 14th for a total 
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gain of 5.55 points per contract or a total gain of $14,278.50 after 
transactions costs. The spread began to drop sharply on the 15th and the drop 
escalated into a rout of the small stocks in comparison with the big stocks in 
the S&P 500. At the end of January the index stood at -31.45. The trade must 
be handled carefully. More or less the December 15th to January 15th period 
seems best. More and more players move the market in the December 15-31 
period and, as in 1985/86, most of the gains occured then. 

Will the stock market crash of October 19, 1987 or the tax changes affect 


the turn of the year effect? We think not but only time will tell. 


William T. Ziemba 


Alumni Professor of 
Management Science 


University of British Columbia 


Dr. Ziemba is currently working on a book tentatively titled "Strategies for 
Making and Keeping Excess Profits in the Stock Market" and he is the co-author 


of "Dr. Z's Beat the Racetrack," Morrow, 1987. 
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‘Political’ investments turn out to be lucrative 


By Greg Heberlein 
Seattle Times Business Reporter 

Investing may be simpler than 
it seems, a research study from 
the world’s most prominent asset 
consultant says: 

Buy stocks of smaller 
companies when Democrats 
control the White House, buy 
bonds when Republicans are in 
command. 

The research report was 
issued today by the Frank Russell 
Co. of Tacoma. The company is 
the premier consultant to pension 
and investment groups. Much of 
Russell’s work centers on 
evaluating adviser performance. 

The research comes with no 
guarantee future results will 
match past returns. But it notes 
that because all stocks tend to do 
better in the second half of a 
president’s term, and because 


President Clinton is a Democrat, 
small-stock performance could be 
vigorous. 

“If past trends hold true, 
small-stock returns could 
increase in the second half of 
Clinton’s term,” said Chris 
Hensel, Frank Russell research 
analyst. “This research 
demonstrates that investing in 
specific categories of stocks and 
bonds according to the party in 
the White House has meant 
dramatically higher returns.” 

How much higher? Much, 
much higher. 

According to a study that 
Hensel co-authored with William 
Ziemba of the University of 
British Columbia, small-company 
stocks performed 10 times better 
under Democrats: 20.5 percent a 
year vs. 1.9 percent a year under 
Republicans in the 1929-1992 


period — 63 years. Larger stocks 
had statistically identical returns. 

But bonds boomed under 
the GOP. The Russell study 
showed that long-term 
corporate bonds, long-term 
government bonds, inter- 
mediate government bonds and 
cash outdistanced stocks on an 
annual basis by 4.9, 5.5, 4.6, 
and 2.9 percent, respectively. 

Already during the Clinton 
term, small-company stocks 
including dividends have risen 29 
percent vs. 11.7 percent for larger 
companies. 

Hensel and Ziemba’s study is 
called U.S. Investment Returns 
During Democratic and 
Republican Administrations, 
1928-1993. 

The Russell 
calculate two 

PLEASE SEE Stock Study ON E5 


company 
in- 


Mixing politics with investment decisions 


proves to be lucrative 


Stock Study 
CONTINUED FROM E 1 
vestment strategies that would 
have maximized returns off this 

theory. 

From January 1942 to 
December 1993, $1,000 invested 
in small-company stocks during 
Democratic terms and in large 
stocks during Republican terms 


Source: Seattle Times, 1994 


would be worth $2.7 million 
today vs. $1.1 million if invested 
only in small-company stocks 
and $289,000 if invested only in 
large-company stocks. 

During the same time period, 
$1,000 invested in 
small-company stocks during 
Democratic administrations and 
intermediate-term government 


bonds during 
administrations would have 
returned $1.2 million vs. 
$103,000 from the traditional 
60/40 mix of larger stocks and 
intermediate bonds. 

Russell defines the largest 
1,000 stocks as large-company 
stocks and the next 2,000 stocks 
as small-company stocks. 


Republican 
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This paper discusses the Nikkei put warrant market in Toronto and New York during 1989-1990. Three 
classes of long term American puts were traded which when evaluated in yen are ordinary, product and 
exchange asset puts, respectively. Type I do not involve exchange rates for yen investors. Type II, called 
quantos, fix in advance the exchange rate to be used on expiry in the home currency. Type IH evaluate the 
strike and spot prices of the Nikkei Stock Average in the home currency rather than in yen. For typically 
observed parameters, type I are theoretically more valuable than type II which in turn are more valuable than 
type III. In late 1989 and early 1990 there were significant departures from fair values in various markets. This 
was a market with a set of complex financial instruments that even sophisticated investors needed time to learn 
about to price properly. Investors in Canada were willing to buy puts at far more than fair value based on 
historical volatility. In addition, US investors overpriced type II puts fixed in dollars rather than the type 
Ps in yen. This led to cross border and US traded (on the same exchange) low risk hedges. The market’s 
convergence to efficiency (that is, all puts priced within transaction cost bands) took about one month after 
the introduction of the US puts in early 1990 leading to significant profits for the hedgers. 


Keywords: option mispricing, cross-border trading, Nikkei stock exchange 


1. Introduction 


The Japanese stock market rivals that of the US in size and importance. Its growth in trading 
volume and capitalization has been large. The markets in Tokyo, Osaka, Nagoya and five other 
regional exchanges that now trade were closed during World War II and then reopened in May 
1949. Post war construction and aid by the US helped the Japanese economy grow quickly. By 
1960, 9 % of the world’s equity capitalization was Japanese compared to 58 % in the US, 27% in 
Europe and 6% in the rest of the world. By 1980 Japan had increased its share to 15 % mainly at 
the expense of Europe whose share fell to 20 %. The rest of the world doubled its capitalization to 
12 % and the US still had the majority with 53 %. The 1980s were a period of economic excesses in 
the US that led to a weakening of the strong economic base held in 1980. The Reagan economic 
policies led to large deficits in both the overall budget and in trade, as well as large increases in 
military spending and debt payments (see Modigliani (1988) and Hatsopoulos, Krugman and 
Poterba (1989) for analyses). Meanwhile Japan maintained a policy of high investment in plant 


This paper was written in 1991 for the presentation of Ziemba (1991a). Relevant references and data occurring in the 
meantime have been updated to 1995. 
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and equipment and R&D, financed through policies that emphasized and rewarded high savings. 
Company expansion proceeded more through debt and retained earnings than equity. Debt was 
readily available and was at low nominal and real interest rates particularly through banks in the 
same industrial grouping (keiretsu). 

The 1980s were very financially favourable for Japanese firms. The combination ot low interest 
rates, easy access to funds, strong export marketing expertise, emphasis on quality, access to 
foreign markets while maintaining structures and rules making imports to Japan difficult, led to 
an enormous relative wealth transfer from the US to Japan. By December 1988, Japan’s equity 
capitalization was 44%, Europe’s 21 %, the rest of the world’s 6 %, and the US fell to 29 %. After 
the 1990-92 stock market decline, Japan’s share fell to 25% as of September 1992. The US was 
then 40%, Europe 25% and the rest of the world 6%. 

French and Poterba (1991) and Ziemba and Schwartz (1991) have argued that to properly 
measure capitalization one must adjust for company cross holdings. If company A holds much 
of company B’s stock and vice versa, then the true market capitalization of A plus B is less 
than the sum of the individual capitalizations. Japan and many of the European cconomies such 
as Germany and Italy have extensive cross holdings. In 1990 about 71 % of Japan’s equity was 
cross-held and rarely, if ever, traded. Calculations show that the capitalization of Japan in the late 
1980s was overstated by about 25 % (see for example, McDonald, 1989 and Ziemba and Schwartz, 
1991). After such adjustment, the shares in 1988--89 were about 39 % for Japan, 33 % for the US, 
22% for Europe and 6% for the rest of the world. 

Historically, Japancse stock markcts have been much more influenced by foreign markets than 
the reverse as shown by Becker, Finnerty and Gupta (1990), Hamao, Masulis and Ng (1990) 
and Ziemba and Schwartz (1991). The transmission of mean returns and volatility was mostly 
unidirectional until the October 1987 world wide stock market crash. Since the crash, stock 
price movements in Japan have had more impact on those in New York and London. However, 
the reverse effect is much stronger (see for example, Hamao, Masulis and Ng (1991)). It may be 
argued that this integration of capital markets was small because the Japanese markets were 
insulated to a large extent from foreign influence and not deregulated. 

Deregulation of Japan’s financial markets began in earnest in 1987 with the introduction of the 
first equity index futures contract, the Kabusaki 50, which traded in Osaka. Futures on the Nikkei 
Stock Average had begun trading on the Singapore Monetary Exchange (SIMEX) in 1986. Bailey 
(1989) discusses the early history of these two futures contracts. In September 1988, trading began 
in futures contracts on the more popular Nikkei 225 stock average and the Tokyo stock price 
(Topix) index which were traded in Osaka and Tokyo, respectively. These contracts allowed 
foreign investors and institutions to easily hedge positions in Japanese equities and to engage 
more fully in a variety of types of programmed trading including index arbitrage and portfolio 
insurance. During 1988 and 1989, the Japanese equity markets increased dramatically in trading 
volume and market capitalization. This was a period of cheap and easily available money for 
corporate and individual investors and speculators. 

The equity warrant bonds issued in Luxembourg with the warrants trading in London was one 
such example. By adding the warrants which were stripped off and traded separately the bonds 
could offer low coupons. Hedging the proceeds of the bond sales, which were mainly in dollars, 
back into yen with its considerably lower interest rates, provided net costs at borrowing at close to 
zero percent. The equity warrants, when exercised several years later if they were in-the-money, 
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provided an additional source of funds for the firm to redeem the bonds for a slight dilution. This 
market was in excess of $200 billion. See Mikami (1990), Takahashi (1990) and Kuwahara and 
Marsh (1992, 1994) for analyses of the pricing of these warrants. A large percentage of these 
warrants expired worthless because of the 1990-95 decline in stock prices. 

Over-the-counter long-dated typically three year puts were marketed in 1988 by major non- 
Japanese brokerage houses to corporate clients who wished to hedge against long Japanese equity 
exposure or to speculate that the high priced Japanese stocks would eventually decline sharply. The 
sellers of these puts, which typically had premium value of $100000 plus and were priced to trade 
at volatilities around 16—20 % versus the historical 13 %, were mostly large Japanese corporations. 
The corporations displayed a collective arrogance about the strength of the Japanese stock market 
and economy by generally not hedging. The high price earnings ratios in the 70 plus range and the 
astronomically high land prices typified by facts such as the Imperial Palace in Tokyo being 
“worth” as much as all the land in California led professional and amateur investors and econo- 
mists to believe that these high prices could not be sustained (see Aron (1981, 1989), French and 
Poterba (1991), Friedberg (1990), Ueda (1990), Wood (1990) and Stone and Ziemba (1993) for a 
variety of analyses with this conclusion). The first Japanese put warrants available to individual 
investors on an easily purchasable basis were the three year American-type Nikkei put warrants 
that traded on the Toronto Stock Exchange in February 1989. These puts were not true warrants 
as they were cash settled based on the price of the Nikkei Stock Average and were not exercisable 
into stock. Also their issuers were investment banks not individual firms. These warrants provided 
individual investors with the opportunity to bet against the high stock prices in Japan with a 
minimal investment of capital. The “warrants” were thus long dated put options. US investors 
were not allowed to purchase those warrants for three months and these warrants were not 
widely advertised and known outside Canada. 

These warrants were purchased in such demand that their price in implied volatility was well 
above the historical for the NSA index. Most Canadian investors probably had no idea what the 
fair value was. However, despite paying prices up to four times the Black-Scholes (1973) fair value, 
investors who held these put warrants to expiry made very large profits because of the large decline 
in the Nikkei index. The Toronto warrants were of three types: ordinary puts valued in yen, puts 
where the final exchange rate for yen is fixed in advance and puts where the NSA was evaluated in 
Canadian dollars. 

This latter type allowed investors to profit from declines in the NSA or the Japanese yen or 
both. Although in principle straightforward to professionals as discussed in Sections 3 and 4, 
below, besides being unable to evaluate the fair values of these warrants investors were unable 
to evaluate the relative differences between the various types of warrants. Thus, with complex 
instruments, even the most sophisticated in the market needed time to understand the products, 
price them fairly and invest in them to eliminate mispricings.! The warrants also had different 
credit characteristics and when exercised were evaluated on the next day’s closing price of the 
NSA in Japan if there was trading that day otherwise on the next trading day. Investors exercising 
warrants could also put in the proviso that the warrant not be exercised if the NSA rose 500 or 


' Indeed Reiner’s (1992) article in Risk is summarized as follows: “Investors and dealers are increasingly turning to equity- 
linked forex options — then finding they don‘t know how to valuc them. Eric Reiner explains how to adapt Black-Scholes 
and its variants”. 
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more points on the requested exercise day. In late 1989 the type I and type III Canadian NSA put 
warrants were greatly over-priced in comparison to fair values based on historical volatilities 
including that for the 1987 world wide stock market crash period. There was no way for a small 
investor to hedge these instruments. Large investors or institutions could, or course, hedge in the 
futures markets on the SIMEX or in Japan. Indeed this was the way that the issuers who sold the 
puts and were responsible for their exercise payments hedged their investment. 

Grossman (1988) among others has argued that this is not a fully suitable approach because 
the futures synthetic does not have the same information requirements as the underlying derivative. 
Hence the same types of difficulties associated with the breakdown of portfolio insurance in the 
1987 crash could possibly occur in this market as well. See Rubinstein (1988) for an analysis of 
the effect of portfolio insurance on the crash. A better hedge for investors was thus a negotiated 
over-the-counter put on the NSA that essentially matched the Toronto stock exchange traded 
warrants. 

Such instruments were available in late 1989 from investment firms such as the Salomon 
Brothers and Bankers Trust. The authors and others were aware of the potential of shorting expen- 
sive Canadian NSA puts and hedging with a fair priced puts of similar characteristics and duration 
in another market. To short the Canadian put warrants these warrants needed to be borrowed 
since there was a fixed number of them issued. They also had to be shorted according to the 
uptick rule. This was more difficult than shorting an ordinary exchange traded put or call or an 
index option which is essentially from an infinite supply and does not have these restrictions. 
However, it was possible to short Canadian NSA put warrants in large numbers at the high 
implied volatilities. It was expected that the market price of these puts would drop to their fair 
value once a fairly priced product was easily available. Bankers Trust, the Salomon Brothers and 
the Kingdom of Denmark issued such warrants in January and February 1990 which traded on the 
American Stock Exchange. These warrants were all fixed exchange rate securities (of type I, called 
quantos) except for the Bankers Trust January put which was a type I with a floating exchange 
rate. 

Bernard and Thomas (1989, 1990), AMeck-Graves and Mendenhall (1992) and Aberbanell and 
Bernard (1992) have shown that frequently there are considerable delays in the market price adjust- 
ment lo new earnings announcements. Indeed some of these adjustments take several months to be 
fully reflected in market prices. Jacobs and Levy (1988) found that lagged earnings surprises are a 
declining but significant factor in US security prices for one, two and three months after their 
announcement. The convergence of the NSA puts to efficiency were similar and the process 
took over one month from the time the first NSA put warrant was traded on the American 
Stock Exchange in January 1990. Large profits were made by hedgers, including the authors, 
although they took several risks that are difficult to quantify. Besides the credit and exchange 
rate risks (which could be hedged) there was a risk of forced buy-ins of the shorts at unfavourable 
prices because it was no longer possible to borrow the puts. All three of the authors had forced 
buy-ins for a small amount of their position. There was also a profitable hedge between fixed and 
non-fixed exchange rate puts that was affected by the absolute price of the puts. 

Developing models to account for these imperfections is difficult; see Figlewski (1989) for some 
results obtained by simulation. In all cases for the Nikkei puts and the Nikkei calls which are 
discussed in Sections 4 to 6, fixed exchange rate options, the quantos, traded for prices above 
non-fixed exchange rate options when the theoretical price was less. Investors were willing to 
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pay a premium to eliminate this exchange rate risk even though it could have been hedged much 
cheaper in the foreign exchange futures markets. Also investors paid a premium for low nominal 
priced warrants that is analogous to that of low priced stocks, see for example, Blume and Stam- 
baugh (1983). This led to a hedge that was close to arbitrage where investors could buy the high 
nominal value but low implied volatility Bankers Trust warrants and sell the lower priced 
Kingdom of Denmark and Salomon type A and type B warrants that had higher implied volati- 
lities on the same exchange (the American Stock Exchange). These warrants were mispriced for the 
month of February 1990. Except for slightly different credit risk and strike prices these warrants 
were virtually identical. One of the authors used this risk arbitrage to win the US stock market 
championship organized in Barron’s in the category of risk adjusted returns for accounts over one 
million dollars in 1990. 

The paper is organized as follows. Section 2 contains a brief background to the Japanese stock 
market bracketing the time of this study (mid 1989 to mid 1990). Historical volatility is also 
discussed there. Additional references on the Japanese stock market include Elton and Gruber 
(1989), Amihud and Mendelson (1991), Chan, Hamao and Lakonishok (1991), Ziemba and 
Schwartz (1991), Ziemba, Bailey and Hamao (1991), Ziemba (1989ab, 199lab), and Stone and 
Ziemba (1993). 

Section 3 discusses the various NSA put warrants and call warrants that were trading in 1989- 
90 on the American and Toronto stock exchanges and categorizes them into the three types which, 
using the definitions in Rubinstein (1991), are ordinary, product and option to exchange. See also 
Donnelly (1990), Smith and Dunn (1990), and Tufano (1992) for general discussions of these 
warrants. 

Section 4 discusses the fair numerical valuation or the three types of puts using the Cox, Ross 
and Rubinstein (1979), Boyle (1988) and Boyle, Evnine and Gibbs (1989) two and three dimen- 
sional binomial lattice models. Other authors such as Derman, Karsinski and Wecker (1990), 
Babbel and Eisenberg (1991), Rubinstein (1991), Gruca and Ritchken (1991), Clyman (1991, 
1992), Chen, Sears and Shabrokhi (1992), Reiner (1992), Wei (1992), Kat and Roozen (1994), 
and Dravid, Richardson and Sun (1993), have also discussed the pricing of the US Nikkei puts, 
particularly the type II fixed exchange rate quantos. 

Section 5 provides a theoretical basis for comparing the three types of puts. Their values 
depend upon the NSA volatility as well as possibly the exchange rate volatility between the yen 
and the home currency (US or Canadian) and their interactions. Using typical parameter values it 
is shown that type I puts should be priced higher than type II and in turn more than type III. This 
is in contrast to the actual pricing where the type II fixed exchange rate options traded for more 
than the ordinary type I puts during the study period. However, even though they were both over- 
priced in relation to historical volatility the type I and type II Canadian put warrants were 
correctly relatively priced by the market. 

Section 6 discusses the put warrant risk arbitrage and the convergence to efficiency of the two 
mispriced markets: the Canadian versus the US and the US fixed versus non-fixed exchange rate 
puts. Relative option costs for fixed NSA volatility and exchange rate volatility as well as implied 
volatility comparisons are made. The preference for fixed exchange rate options which applied to 
puts also applied to calls which began trading in April 1990. The mispriced securities we discuss are 
referred to as hedge candidates although in most cases they are close to arbitrage. Classical index 
arbitrage is actively pursued in Japan especially by the foreign firms, see for example Miller (1993). 
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Discussions of the potential profitability of such arbitrage appear in Brenner, Subrahmanyam and 
Uno (1989, 1990), Brooks and Yamada (1990), Lim (1992), Chung, Kang and Rhee (1992). 

Section 7 briefly discusses the relationship between the NSA put warrant prices in North 
America and the next day’s cash market in Japan. With deep in-the-money options, the put 
discount or premium signalled the up or down direction of the NSA on the next day in Tokyo. 
For a small data set the conclusion is that the signal was correct 68% of the time. This is consistent 
with the conclusion that futures hedging of these instruments had a strong effect on the Japanese 
stock market. Gruca and Ritchken (1991) also noted similar behaviour on the opening prices 
in Japan. A discussion of implications of the findings and concluding remarks appears in 
Section 8. 


2. The Nikkei stock average 1949—1995 and its historical volatility 


The NSA is a price weighted average of 225 large capitalized stocks traded on the Tokyo Stock 
Exchange. It is defined as 


where D, = divisor at time ¢. The original divisor was Djg49 = 225 and Dj992, Dec = 9-967. Figure 1 
shows the NSA from July 1983 to June 1995. The NSA was 109.9 when it began trading in May 
1949, It peaked at 38916 at the end of December 1989. There were twenty declines” of ten percent 
or more during 1949 to 1989. The index rose 220.84 times in yen and 553.04 in dollars from 1949 to 
1989. There were nine declines of ten percent or more during 1990-92. The index fell to 16925 at 
the end of 1992 a decline of 56.5% from the December 1989 high. Investors from 1949 still had 
96.21 for each yen invested and 277.53 for each dollar invested. The 1990-92 decline had its 
minimum at 14309, a decline of 63.2% from the December 1989 peak, on 17 August 1992. 
There is a very active index arbitrage market in the NSA which has been studied by Brenner, 
Subrahmanyam and Uno (1989, 1990), Chung, Kang and Rhee (1992), Miller (1993), and 
others. The value of the futures volume on the NSA trading in Singapore, Osaka and Chicago 
is the highest of any equity index in the world. 

The press called the stock market decline during 1990—92 the bursting of a ‘speculative 
bubble’. See also Tachi (1993) for a similar conclusion by the Ministry of Finance. Ueda (1990) 
and French and Poterba (1991) among others pointed to the very high Japanese stock prices with 
price earnings ratios of seventy or higher in 1988-89. Stone and Ziemba (1993) have analysed the 
steep rise in the stock and land markets during the 1980s in the era of cheap and readily available 
money and the subsequent steep decline largely caused by the Bank of Japan’s tight money policy 
of raising interest rates and decreasing the supply of money. They concluded that the decline in the 
stock and ‘essential use’ land markets can be explained as an adjustment to changing fundamen- 
tals. Speculative land such as the membership prices of golf courses and condominiums, on the 


> A decline is defined as the peak to valley when the fall exceeds ten percent and any subsequent rise would invalidate the ten 
percent fall. 
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Fig. 1. The NSA weekly close July 1983—June 1995. 


other hand, appears more likely to have been a speculative bubble. Table 4 (see forward) points to 
the high stock prices relative to past levels at the end of 1989. 

This paper studies the period mid—1989 to mid—1990. During 1989 the historical volatility was 
in the 10% range or slightly below its 1949-1989 average of 13%. Volatility has not been 
constant. Figure 2 shows the monthly averages of daily volatility from May 1949 to April 1989. 
While volatility peaked at 73.5% in October 1987 most of the time the annualized standard 
deviation was less than 20%. Volatility tends to rise in declining markets (Schwert, 1989; 
Turner and Weigel, 1992). The 1990-91 period in Japan had historical and implied volatilities 
in the 30—60 % range for much of this period; see Fig. 3. 

The annualized volatility was computed using 


250 22 E 
a= (BE -77) 


NSA, 
r, = 1001n | 


where 


for the last 20 trading days of the month, and NSA, is the closing value of the NSA on day t 
assuming there are 250 trading days per year. 


188 Calendar Anomalies and Arbitrage 


250 Shaw et al. 


73.6 (Oct 1987) 


60 | 56 6 (Dec. 1949) 


50.0 (Aug. 1971) 


40 


20 


| 
hi Wahl RAMEE Jd ' 


1949.5 53.5 57.5 61.5 65.5 69.5 73.5 77.5 81.5 85.5 89.4 


Fig. 2. NSA historical volatility, monthly averages of daily data annualized in percent, May 1949-April 
1989, Source: Jun Uno, Nihon Keizai Shinbun, Inc. 


3. NSA put warrants on the Toronto and American Stock Exchanges 
1989-1990 


The three year American style NSA put warrants are of three basic types. Let NSAọ be the strike 
price and NSA, the expiry price of the Nikkei stock average in yen. Let Ep be the current exchange 
rate and ŒE, be the exchange rate on expiry for Canadian or US dollars into yen. The symbol (X)~ 
means the greater of X or zero. Then in yen we have using Rubinstein’s (1991) classification of 
exotic options, where a, b, and c are constants. 


Value on expiry Type of put Currency risk in U.S./Canadian 
dollars? 
I. a[NSAy — NSA)" ordinary yes 
E, 
I. ba [NSAy — NSA,|* product no 
0 
NSAy NSA,\* : a Bae ds 
Ill. c E; ( E Dae E e) option to exchange yes, in index value and in this difference 
0 -e 


with the strike price converted to the 
home currency 


It is convenient to value the puts in yen as discussed in the next section. In their home currency 
(U.S. or Canadian dollars) using the symbols defined in Table 1, the puts (including the Paine 
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Fig. 3. Historical and implied NSA volatility 1989-1991. (——) Trailing 21-day historical volatility, 
(+) Osaka implied volatility. Implied volatility is the average of closest to money NSA puts and calls 
for the nearest month using the Gensaki as the interest rate. Source: Baring Securities. 


Webber and Salomon calls) are: 


Puts Calls 
a = SA + 
I. oo) BT-I, SEK, BTB, London OTC PWA 
e 
— NSA,\* 
I. p(t) BT-III, BT-IV, TFC, DXA, EXW, SXA, 
es yar A SXO, PXB Sal 
TI. Wee ee BT-II 
( Ey Ł, ) 


Each put warrant payoff function can be written as Lmax (strike-underlying, 0) Currency Units. 
That is L put options on the named underlying denominated in the specified currency unit. The 
actual payoff may be in different currency units converted at exercise or expiry at the exchange rate 
prevailing then. The warrants may be traded in different currency units. Neither of these affect the 
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Table 1. NSA classification of put and call warrants trading in Canada and the US in 1989-90. 


Warrant Payoff 


Canadian puts 


Bankers Trust-I(BT-D Cdn equivalent at rate then prevailing of yen 0.1168 (32174 — NSA)* 

Bankers Trust-H (BT-I) Cdn 0.1031 (270.54-NSA,/E,)* where E, is the number of yen per Canadian 
dollar at exercise 

Bankers Trust-III (BT-II]) Cdn $2.50/7.25 % (37 460.32 — NSA)*/37 460.32 = 0.0009205 (37460 — NSA)* 

Bankers Trust-IV (BT-IV) Cdn $2.50/7.25 % (29 843.34 — NSA)*/29 843.34 = 0.0011555 (29843.34 — NSA)t 


Trilon (TFC) Cdn $2.75/7 % (37 416.32 — NSA)* /37 416.32 = 0.0010487 (37416.32 — NSA)* 
SEK Cdn equivalent at rate then prevailing of yen 0.1168 (35963.74 — NSA)* 

US puts 

Kingdom of Denmark (DXA) USS 0.2 (37516.77 — NSA)*/145.33 = 0.0013762 (37 515.65 — NSA)” 
Salomon—I (SKA) USS 0.2 (36821.14 — NSA)*/145.52 = 0.0013744 (36821.14— NSA)* 

Bankers Trust (BTB) USS equivalent at rate then prevailing of yen 0.5 (37206.42 — NSA)* 
Salomon-II (SXO) USS 0.2 (37 471.99 — NSA)*/144.55 = 0.0013836 (37471.99 — NSA)* 

Paine Webber (PXB) USS 0.2 (29 246.06 — NSA)*/159.80 = 0.001252 (29 246.06 — NSA)* 

Salomon Warrant OTC, yen 1.0 (32806 — NSA)* 

London 


A/S Eskportfinans (EXW) USS 0.2 (29424.58 — NSA)*/158.84 = 0.0012591 (29424.58 — NSA)* 


US calls 
Salomon (SXZ) 1/15 (NSA — 28 442.94)* 1/158.8 = 0.00041982 (NSA — 28 442.94)* 
Paine Webber (PWA) US$ equivalent at rate then prevailing of 1/10 (NSA — 29 249.06)" 


underlying put option, but its pricing changes. Thus the fixed characteristics of each warrant 
namely: the leverage factor (L), the strike price (X), the name of the underlying (Under), and 
the currency unit of the underlying (CU) places each of the warrants in the standard form 
shown in Table 1. 


4. Fair valuation of NSA put and call warrants 


All of the warrants involve the NSA index and are American type and are valued in yen and 
may involve an exchange rate. Boyle’s (1988) generalization of the Cox-Ross-Rubinstein (1979) 
binomial lattice model was used to create 3—dimensional lattices to model the evolution of the 
NSA and the exchange rate and their interaction over time. In terms of calculation steps for n 
time steps, the CRR is of order n? and Boyle’s is of order n°. The value of the option is the expected 
present value of the option payoff in an economy in which the drift of a risky asset is the risk-free 
rate minus its dividend yield. The discount factor used to calculate the present value of the payoff is 
the risk-free rate. For the dividend yield, we use the foreign interest rate. The NSA dividend yield 
during 1989 was about 0.5%. While most of the dividends are paid in March and September, see 
Ziemba and Schwartz (1991), the continuous approximation used is accurate because the yield is so 
small. We ignore the typical one day or longer time lag between giving notice of exercise and 
actually being cashed out and other special provisions of the warrants including the credit risk 
of the issuer. We also ignore the possibility of predicting the mean return and use standard 
analysis; see Grundy (1991) and Lo and Wang (1995) for models based on return predictability. 
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4.1. Type I put warrants 


The exercise value is Yen L Max|(X — NSA,),0]. The currency plays no role. At expiry one 
converts yen immediately into dollars. Hence one has a standard put option which may be 
valued on a CRR lattice in yen. 


4.2. Type IT put warrants 


The exercise value is Yen L Max [(X — NSA,), O|E;. 

If the Cov (NSA, E) = 0 then the put is an American style on the NSA which may be valued on 
a CRR lattice.” Since the puts are American, lattice methods are required for accurate price 
evaluation. The interest rate used is that in the foreign pay currency. The dividend yield is replaced 
by the actual dividend yield on the index plus the interest rate differential. The currency risk may be 
hedged away by paying an extra dividend yield equal to the differential. By hedging into the foreign 
currency, the appropriate discount factor is the foreign risk free rate. 

Type II warrants are generalizations of type I warrants in that a type I warrant is a type II 
warrant for which the payout currency is yen. 


4.3. Type LIT put warrants 


The exercise value is Yen L Max [(XE, — NSA,,0]. These warrants are fundamentally different and 
only the Canadian BT-II is of this type. They have fair values above intrinsic even if the NSA has 
zero volatility and they may be valued as an option to exchange with a minor modification of 
Margrabe’s (1978) formula. Two risky assets a, b with values S,, Sẹ have 


payoff = Max (0, Sp — Sa) 
= S, Max (0,1 — S/S) 
= S, Max (0, S;,/S,— 1) 
Thus an option to exchange may be regarded as a put on the value of S, denominated in units of S$, 
or a call on the value of S, denominated in units of S,. Let o, and g, be the volatility of a and b, 
respectively, and p be the correlation of the log of the price relatives. Margrabe’s formula requires 


the dividend yield to be zero in the Black-Scholes put and call pricing. Using the notation: pricing 
formula (option type)(asset, X, T, ø, Div B, Div A) asset, 


BS put (S,/S;, 1, T, o, 0, 0) S, or equivalently BS call (S,/S,, 1. T, o, 0, 0) S,, where 
a = V (0} — 2pa ga + 07) 


? Data shows that Cov (NSA, $Cdn/¥) = 0 and Cov (NSA, $US/¥) = 0. If the Cov (NSA, E) # 0, then one may value 
these puts on a CRR lattice by adjusting owsa to oyga +P Onsaoe With the dividend yield equal to dys4 + rusa/can— 
"Japan + P ansa Te: Sec c.g. Reiner (1992) for proof. Thus a 3-dimensional lattice is not needed. 

+A closed form solution exists for this product option in the European case assuming log normal NSA and log normal 
currency changes since the product of lognormals is lognormal; see e.g. Merton (1973). This is developed specifically in 
Gruca and Ritchken (1991) and Clyman (1991, 1992). The latter author also develops arbitrage relationships updating the 
Merton (1973) analysis to this case. See also Reiner (1992), Dravid, Richardson and Sun (1993), and Kat and Roozen 
(1994). 


Table 2. Prices and implied volatilities of actively traded NSA puts and calls on the Toronto, American and London over-the-counter 
stock exchanges, 23 July 1990. 


18 % 


18% 


Ask Country Implied Vol. Relative Options Intrinsic 

Warrant Type price of issue Leverage Strike Expiry Currency volatility price cost delta value 

BT-I Put 2.530 CAD 0.1168 32174 17 Feb 92 JPY 23.9% $1.76 44% —0.40 $0.25 
BT-II Put 2.530 CAD 0.0011552 29843 15 Jun 92 CAD 26.4% $1.27 99% —0.24 $0.00 
BY-Il Put 6.500 CAD 0.0009203 37460 16 Mar93 CAD 29.3 % $5.15 26% —0.90 $5.12 
TFC Put 8.000 CAD 0.0010487 37460 22 Feb 93 CAD 32.7% $5.87 36% —0.90 $5.84 
SEK Put 4.000 CAD 0.1168 35964 16 Nov 92 JPY 18.3% $3.97 1% —0.71 $3.71 
DXA Put 10.000 US 0.0013762 37516 3 Jan 93 USD 29.9 % $7.78 29% —0.90 $7.74 
SXA Put 9.000 US 0.0013744 36822 19 Jan 93 USD 28.2% $6.94 30% -0.80 $6.77 
SXO Put 9.875 US 0.0013836 37472 16 Feb 93 USD 29.0 % $7.78 27% —0.88 $7.72 
BTB Put 20.500 US 0.5 37206 16 Jan 93 JPY 23.8% $18.16 13% —0.85 $17.93 
PXA Put 3.125 US 0.0012516 29249 8 Apr 93 USD 26.6 % $1.45 16% —0.22 $0.00 
PWA Call 5.500 US 0.1 29249 8 Apr 93 JPY 18.4% $5.47 1% 0.84 $1.79 
SXZ Call 3.500 US 0.0004198 28443 6 Apr 93 USD 15.4% $3.61 3% 0.86 $1.45 
Sal OTC Put 28.50 US l 35750 21 Feb 92 JPY 19.2% $27.62 3% —0.74 $26.03 
Sal OTC Put 2.000 US 1 31033 7 Aug 90 JPY 26.1% $0.90 122% —0.20 $0.00 
Sal OTC Put 9.25 US l 28 139 19 Jan 91 JPY 29.9 % $2.75 236% —0.14 $0.00 
Sal OTC Put 20.25 US 1 32 806 24 Apr 92 JPY 22.8 % $15.42 31% —0.44 $6.15 
Sal OTC Put 35.74 DM* 1 36969 26 Jun 91 JPY 22.7% $34.26 4% -0.99 $34.26 
Sal OTC Call 38.50 US l 28139 19 Jun 91 JPY 18.0 % $38.49 0% 0.87 $25.35 
SalOTC Call 21.50 US 0.5 29278 7 Apr 92 JPY 17.7% $21.60 0% 0.82 $8.83 


*The 36969 Sal OTC put traded in Deutchmarks but is valued in US dollars. 
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In a risk neutral economy the drift of S, and S, is r, the risk free rate, thus S,/S, has drift of zero. 
Margrabe’s formula can be extended to dividend paying assets with payoff ¥(S, — S4), where X is 
a constant as follows. Since S,/S, drifts at rate Div b — Div a where Div is dividend yield. The discount 
rate is Div b since the value is measured in units of S,. The value of an American style option to 
exchange S, for S, is then S,CRR (opt type, S,/S,, X, T, s, Div B, Div A). Hence the pricing is 


L EX,CCR (Put, NSA,/E,,X,T, 0,ig, Div NSA). 


The various trading prices, currency of issue, strike prices, leverage values, expiry dates, 
implied volatility, 18% volatility price, relative cost (% above or below 18%, volatility price), 
18% delta and intrinsic values for the various NSA put warrants are illustrated in Table 2 
for 23 July 1990. The basic data is NSA = 31895, Cdn$ = ¥ 148.13, with interest rates of 
11.75% Canadian, 7.361 % US and 7.319 % Japanese. 


5. Numerical comparison of warrant types I, II and IIT 


The three types of warrant puts may be compared as follows. Assume that the American puts have 
a two year exercise period, the home currency is normalized at 1, the NSA is 100, the Japanese 
interest rate is 6 %, the foreign (Canadian or US) interest rate is 10%, the NSA has a continuous 
yearly dividend of 0.5 % and the standard deviation of the NSA is 20 %. The relative values of the 
three types of puts vary with different assumptions on the volatility of the exchange rate and the 
covariance between the NSA and the exchange rate. Assume that the volatility of the exchange 
rate is 5, 10, or 20 % and that the covariance of the NSA and the exchange rate is —0.5, 0.0 or 0.5. 
Table 3 contains fair values for these warrants in terms of percentage of the NSA. 


Table 3°. Comparison of fair values of NSA put warrants. 


a. Type I Volatility of the exchange rate 
% 10% 20% 
—0.5 7.44 7.44 7.44 
Cov (NSA, E) 0.0 7.44 7.44 7.44 
0.5 7.44 7.44 7.44 

b. Type Il Volatility of the exchange rate 
5% 10% 20 % 
—0.5 7.36 7.57 8.00 
Cov (NSA, E) 0.0 7.13 7.13 7.13 
0.5 6.95 6.77 6.42 

c. Type IM Volatility of the exchange rate 
5% 10 % 20 % 
—0.5 7.03 8.65 12.51 
Cov (NSA, E) 0.0 6.02 6.78 9.51 
0.5 4.90 4.59 5.75 


$ The accuracy of these values depends upon computer implementation including the number of nodes in the binomial 
lattice. Eric Reiner in a private correspondence in 1993 found the following similar values using a 1024 node lattice with 
continuous rates of 0.06, 0.10, and 0.005 corresponding to annualized rates of 0.0161866, 0.1051709, and 0.005S0125: 


Type I: all 7.432 Type II: 7.091 8.717 12.575 
Type II: 7.367 7.575 8.011 6.061 6.841 9.568 
7.167 7.167 7.167 4.933 4.932 5.789 


6.973 6.786 6.431 
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When priced in yen, type I do not involve exchange rates, hence all warrant values are equal. 
With zero covariance between the NSA and exchange rates, the values of type II warrants are the 
same regardless of the volatility of the exchange rates. This value is less than that for type I 
warrants because of the positive differential between the foreign interest rate and the Japanese 
rate. However if the covariance is non-zero, then the value of type II warrants depends both on 
that covariance and the volatility of the exchange rate. In general, the value of the warrant 
increases with the volatility of the exchange rate and for put warrants, decreases as the correlation 
increases. Positive correlation means that negative returns on the NSA are associated with a 
strengthening of the yen. The investor receives returns if the NSA declines so if this is accompanied 
by a stronger yen, the payoff is less at exercise than otherwise would be received. 

Type IT] warrants have the most interesting behaviour. Their value depends on the volatility of 
the exchange rate even when the correlation is zero. In general, the higher the correlation, the lower 
the value of the warrant. With positive correlations, the value of the warrant for both low and high 
values of exchange rate volatility is higher than that for the intermediate. For typical observed 
parameters — covariance zero, exchange volatility about 10% and foreign interest rate above 
Japan’s - type I warrants are generally worth more than type II warrants which are in turn 
worth more than type III warrants, all other parameters (leverage, strike price, time to expiration, 
etc) being equal. There is a similar relationship between the type I (Paine Webber) and type II 
(Salomon) calls (see forward to Figs 9 and 10). 


6. Constructing the put warrant hedges and the convergence to 
efficiency 


The various exchange traded puts in Canada and the US and the over-the-counter puts traded in 
London had many common and several different characteristics that led to significant price differ- 
ences. Reasons for the price differences from fair values include currency and cross border risks, 
different credit risks, difficulties with borrowing for short sales, price effects due to the differing size 
of the warrants, differing strike values, inability to value the warrants properly, differing exercise 
provisions, market sentiment and volatility differences. The London over-the-counter market was 
active in 1988 and 1989 for large institutional investors. Prices were quoted by the market makers 
based on historical volatility (in the 15 % range) plus a profit margin. Salomon Brothers and to a 
lesser extent Bankers Trust, made the market with large bid ask spreads as shown below. On 24 
November 1989, the NSA was 36484 and three of the Salomon Brothers over-the-counter put 
warrants were priced as follows. 
Price in dollars 


Strike Expiry date Price in yen 24 Nov 1989 23 Oct 1990 
32 806 24 April 1992 ¥ 934-970 $6.50—6.75 $65.00 
31033 7 Aug 1990 503-575 3.25-3.75 Expired 
28,139 19 June 1991 75.5—76.5 0.40-0.45 $39.50 


By 23 October 1990, the puts had increased at least ten times and the third issuc nearly one 
hundred times. 
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The Canadian put warrants BT-I and BT-II were the first opportunity for non-institutional 
Canadian and US investors (three months after issue) to profit from a fall in the Japanese stock 
market. BT-I was issued in February 1989 and BT-II in June 1989. These warrants were very 
popular with investors and traded for very high premiums and implied volatilities. 

There was considerable good reason to believe that the Tokyo market was overvalued. See for 
example French and Poterba (1991) for one analysis based on adjusted price earnings ratios and 
Ziemba and Schwartz (1991) for a synthesis of various studies. One way to evaluate this is via Paul 
Aron’s adjusted price earnings ratios which are comparable to French and Poterba’s adjustments 
although somewhat lower. Aron (1981, 1989) computed these ratios to the end of August 1989. His 
values are shown in Table 4. His adjustments reflect different accounting and business practices, 
cross holding effects and different capitalization rates. Ziemba and Schwartz (1991) updated 
Aron’s adjusted values after August 1989 with assumptions concerning the earnings change of 
the NSA and capitalization rates. The values are shown in Table 4 up to 22 February 1991. The 
31 December 1989 value of 23.9 was the highest adjusted price earnings ratio at any time since 1949 
and pointed to extreme risk in the stock market. 

Despite its decline during 1990, it was not until the steep decline on 1 October that these values 
became cheap relative to historical price earnings ratios. Other stock market valuation models such 
as bond and stock yield differences, see Ziemba and Schwartz (1991), also were at historical high 
values at the end of 1989°. All of these models are driven by two factors: earnings forecasts and 
interest rates. The extreme increase in interest rates in 1989 from a 2.5% discount rate to 5.25% 
and later to 6.0% was at the heart of the estimated overvaluation. 

In a multivariate factor model regression study for the period 1979-1989, Ziemba (1989) found 
that future earnings forecasts were by far the most important variable for predicting the rates of 
return of Japanese stocks. 

Hence there was considerable reason for investors to believe that the Japanese market would 
crash or at least decline sharply. Since the Canadian puts were the only product available to invest 
in this belief, their prices were understandably very high, particularly given that it was difficult for 
nearly all of the purchasers of these puts to fairly value them. Seasonality observers also noted that 
the decline in January 1990, while only 4.5 %, was a key negative signal since January has histori- 
cally provided the highest returns in the Japanese markets, see Ziemba (1991b). Moreover, the 
conditional probability of a decline in the rest of the year following a decline in January is quite 
high; see Hensel and Ziemba (1995). 

Table 3 shows that the fair values of type I warrants generally exceed those of type IT which in 
turn exceed those of type III. The BT-I is a type I and the BT-II a type III. In terms of premium, 
see Table 5, the BT-I was priced higher than the BT-II. This was the case in the entire trading 
period from September 1989 to the eventual collapse in February 1990. Table 5 provides insight 
into pricing differences for Canadian—US hedges (A) and US hedges (B), but those provided from 
theoretical option pricing models, as we now discuss, were used in our analysis. For example, 
shorting of BT-I or BT-II and buying BT—US looks like a potentially profitable cross border 


é Whenever the difference was more than two standard deviations above the mean, the market was said to be in the “danger 
zone”. This model was applied throughout the 1949-1989 period. During this time there were twenty declines of 10% or 
more. Every time the markct was in the danger zone during this 40 year period, a decline of 10% or more occurred. There 
were declines of 10% or more without the difference being the “danger zone”. 
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Table 4. Paul Aron’s adjusted PERs for Japan compared with those in the US, 26 April 1981 to 31 
August 1989, with adjustments for later periods to 22 February 1991 by Ziemba and Schwartz (1991). 


Date US PER Japan, adj PER NSA 
26 Apr 1981 7548 
19 Oct 1984 10929 
17 Apr 1986 15827 
26 May 1987 24533 
11 Sept 1987 24829 
31 Dec 1987 21 533 
31 May 1988 26963 
30 Aug 1988 27679 
31 Aug 1989 34808 
31 Dec 1989 38915 
30 Mar 1990 29980 
22 June 1990 31 694 expensive | 
30 Sept 1990 20983 aera 
1 Oct 1990 20022 
2 Oct 1990 22896 
31 Dec 1990 23849 
22 Feb 1991 25903 
Values after August 1989 assume: 

Interest 

Earnings gain 

Date US Japan over Aug 89 
Dec 1989 8.2 6.4 5% 
Mar 1990 8.4 7.4 10% 
June 1990 8.2 7.0 12% 
Sept 1990 8.2 7.75 8% 
1 Oct 1990 8.1 7.5 8% 
2 Oct 1990 8.1 7.5 8% 
31 Dee 1990 7.6 6.5 8% 
22 Feb 1991 7.6 6.0 8% 


hedge trade. Similarly shorting Kingdom of Denmark or Salomon 1 and buying BT-I looks like 
a potentially profitable hedge trade on the American stock exchange. Figures 4 and 5 show the 
theoretical pricing in two ways. Implied volatilities appear in Fig. 4. They illustrate the point. 
However, implied volatilities did not exist at many dates in 1990 when the puts were trading at 
discounts (as discussed later in the paper); see the vertical lines in Fig. 4. Hence, a preferable way to 
compare the warrants prices is by their relative cost. That is actual cost minus theoretical value as a 
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Table 5. Comparison of prices and premium values for four Canadian and three US NSA put warrants on 1 
February 1990. 


% of Hedging actions 
NSA Expiry Years to Premium Premium — 
Warrant Price unit date expiry % per year % A B 
BT-1 $2.70cdn 11.68 % 2/17/92 2.05 20.1 9.8 SELL 
BT-II $1.93cdn 10.31 % 6/15/92 PAA 16.4 6.9 SELL 
BT-HI $2.50cdn 14.29 % 2/16/93 3.05 7.0 2.3 
Trilon Finl $2.75cdn 13.7% 2/22/93 3.05 7.25 2.4 
K of Denmark $5.63us 20% 1/3/93 2.93 10.1 3.4 SELL 
Salomon-I $4.63us 20% 1/19/93 2.97 10.1 3.4 SELL 
BT-US $9.17us 50% 1/16/93 3.00 8.0 2.6 BUY BUY 


percentage of theoretical value. This is shown in Fig. 5 assuming a volatility for the NSA of 20 % 
and an exchange rate volatility of 10%. 

There were no NSA put warrants trading in the United States until the Kingdom of Denmark 
(type II) put warrant began trading on the American Stock Exchange on 3 January 1990. The 
Salomon A (type II) and Bankers Trust (type I) put warrants began trading two weeks later. 
With the availability of these three warrants investors in the Canadian put warrants could 
replace these warrants with the much cheaper US instruments. Figures 4 and 5 show that it 
took more than a month for the Canadian puts to converge to efficiency. A gradual decline 
began with the introduction and market knowledge of the three cheaper US instruments and 
then there was a sudden collapse in late February 1990 just after the second Salomon put 
warrant (a type II) began trading. The slowness of the market to react to new information is analo- 
gous to that of stock prices which are frequently slow to react to new earnings information, see 
Affleck-Graves and Mendenhall (1992), Bernard and Thomas (1990, 1992) and Aberbanell and 
Bernard (1992). The market needed time to understand, evaluate and then fairly price these 
complex instruments. 

The collapse occurred at a time of minor decline in the NSA in February 1990 well before the 
steep declines in March and April 1990. Hedge investors who were able to short the Canadian put 
warrants and buy cheaper US warrants particularly the Bankers Trust BTB could have made 
considerable profits’. 

A second advantageous hedge is illustrated in Figure 6. Despite the fact that the theoretical fair 
values of type I warrants was larger than type II, US investors had a preference for type IT instru- 
ments. Apparently they preferred a fixed exchange rate in dollars upon expiry rather than to value 
the puts in yen. To eliminate the currency risk, they paid more for type II warrants than if they had 
bought type I warrants and hedged the currency risk in the futures market. Hence type I] warrants 
traded for prices which were much larger than those of the type I warrants. There was also a price 


7 This hedge had relatively low risk but was not a truc arbitrage given that there were different credit risks and other 
characteristics of the various put warrants. There was also the difficulty of securing and holding borrowed warrant short 
positions. The threat of forced buy-in was also present. All three authors did have forced buy-ins of short positions, but the 
amount was small so that the overall hedge was very successful. 
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Fig. 4. Implied volatility of BT-I, BT-II, and BTB NSA put warrants assuming exchange rate volatility of 


10 %, 17 February 1989 to 21 September 1990. (A) BT-I, type 1, Canadian, (+) BT-II, type 3, Canadian, and 
(©) BTB, type 1, US. 


effect. The BTB warrant represented 0.5 of an NSA unit and the DXA, SXA and SXOs were worth 
only 0.2 of an NSA. Hence the BTB should trade, other things being equal, at about 2.5 times plus 
or minus a transactions cost band around the other warrants. In fact the BTB usually traded at 
prices much lower. This is analogous to the low priced stock effect that captures much of the 
January small firm effect, see Blume and Stambaugh (1983). These two factors yielded the profit- 
able hedge from January to March 1990. After convergence to efficiency these markets have since 
generally traded within transactions costs bands. 

Figure 7 shows the relative costs of various put warrants in Canada and the US with similar 
strike prices. These warrants were all issued in early 1990 and had NSA strikes between 36 822 and 


8 Additional analysis of the post March 1990 period for various US NSA put warrants , particularly of type IT, appears in 
Clyman (1991, 1992), Chen, Sears and Shahrokhi (1992) and Dravid, Richardson and Sun (1993) among others. Generally 
speaking, after transactions costs are considered, the market was efficient. 
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Rel Cost @ Nik Vol 20%, Exch Vol 10% 
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Fig. 5. Relative costs of BT I, BT II] and BTB NSA put warrants with NSA volatility of 20 % and exchange 
rate volatility of 10%, 17 February 1989 to 21 September 1990. Relative deviation from model price = 
(actual cost — theoretical value)/(theorctical value). (+) BT-I, type 1, Canadian, (©) BT-II, type 3, 
Canadian and (A) BTB, type 1, US. 


37472. Here are shown the higher prices paid for type II warrants in comparison to type I from 
January to the end of February 1990. 

Figure 8 shows the relative costs of Canadian type I, If and HI NSA put warrants. Investors, 
relative to fair prices paid more for type I (BT-I) than for type III (BT-II) until the market 
converged to efficiency in late February 1990. From March to September 1990, ail three types 
of put warrants had relative costs within transaction cost bands. Figures 9 and 10 give the 
implied volatilities and relative costs of the two NSA call warrants traded on the American 
Stock Exchange. The Paine Webber call is a type I and the Salomon is a type H. The fair value 
of a type I should be higher than a type II. However, US investors preferred the type LI with its 
fixed exchange rate of dollars into yen and bid its price higher during most periods from April to 
October 1990. 
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Fig. 6. Relative cost of US type I (BTB) versus US type I] (DXA, SXA, SXO) NSA put warrants, January to 
September 1990, assuming NSA volatility of 20 %. (C1) BTB, type 1, 0.5 NSA, (+) avg DXA, SXA, SXO, type 
2, 0.2 NSA, and (—) normalized Nikkei. 


Table 6. Percent of days the intrinsic value exceeded the market value (Source: Clyman, 1991). 


Month DXA SXA SXO PXB EXW 


Jan 0.0 0.0 na na na 

Feb 0.0 0.0 0.0 na na 

Mar 18.2 22.7 27.3 na na 

Apr 65.0) 70.0 $5.0 0.0 0.0 
May 40.9 40.9 40.9 0.0 0.0 
June 0.0 0.0 0.0 0.0 0.0 
July 0.0 0.0 0.0 0.0 0.0 
Aug 43.5 43.5 43.5 0.0 0.0 
Sep 57.9 52.6 52.6 5.3 0.0 
Oct 52.2 47.8 52.2 4.3 4.3 
Average 29.1 29.5 35.8 1.4 0.8 


Exercise Price 37517 36821 37472 29249 28425 
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Fig. 7. Relative costs puts with similar strike prices, assuming 20% NSA volatility, Yen/Canadian dollars 
volatility of 10%, January to September 1990. (C) BT III, (+) TFC, (©) DXA, (A) SXA, (x) SXO, and 
(V) BTB. 


7. The relationship between NSA put warrant prices in North America 
and the cash market in Tokyo 


During much of 1990 the Toronto and New York NSA put warrants were trading deep in the 
money. Frequently the puts traded for less than their intrinsic value, see Table 6. Tokyo's next 
trading session was the following day. Since much futures hedging was required to protect the 
issuers’ positions, and that trading would lead to index arbitrage if the futures prices deviated 
much from fair value, the prices in North America provided a forecast of the likely prices in 
Tokyo. If the put was trading at a discount one would expect the Tokyo market to rise. Similarly, 
the forecast was for a fall in the Tokyo market if the put was trading at a premium. An indication 
of the size of the market is that during 1990 the NSA puts averaged 13 % of the trading volume 
(and a similar fraction of the trading value) on the American Stock Exchange. Informal estimates 
by the authors of the size of the market in the US and Canada suggests that it was possible that 
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Fig. 8. Relative costs of Canadian type I, II and III put warrants based on 20 % NSA volatility and 10 % Yen/ 
Canadian dollar volatility, January to September 1990. (+) type II, avg BT. UI, TFC, SEK no Volume, 
(©) type I, BT-I, (A) type III, BT-II and (V) normalized Nikkei. 


upwards of 20% of the NSA futures trades in Osaka and Singapore were related to NSA put 
hedging. Consider for example, the SXA Salomon January 1993 NSA put. It had a strike price 
of 36821.14, a currency conversion rate of 145.52 yen per US dollar, and is worth 0.2 of an NSA 
unit. The intrinsic value of the put was 


0.2 (36821.14 — NSA) 
145.52 $ 
Hence the implied NSA is 36821.14 — 5 (145.52)P. Table 7 shows that on the seventeen of the 
twenty-five trading days from 1 August to 6 September 1990, the forecast was correct (six of the 
incorrect predictions were expected rises). 
There are many control aspects to a full study of this relationship such as open versus closed 
prices, futures effects, are futures and warrants giving the same estimate of cash prices, etc? 


? For some progress on this see Dravid, Richardson and Craig (1993) and Yuen (1994). 
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Fig. 9. Implied volatility of the Paine Webber and Salomon NSA call warrants, April to October 1990. (D) 
PXA, and (+) SXZ. 


However, there seems to have been a strong relationship between the price of the NSA put 
warrants in North America and cash prices in Tokyo on the next trading day. 


8. Implications of the findings and concluding remarks 


The paper has described two favourable hedges involving Nikkei put warrants during the period 
November 1989 to February 1990. The cross border hedge involved shorting overpriced Canadian 
Nikkei put warrants which traded on the Toronto Stock Exchange and purchasing either Nikkei 
puts with negotiated terms over the counter in London or exchange traded puts on the American 
Stock Exchange. Since the Canadian puts were unavailable to US investors for three months from 
their issue in February and April 1989 they were not heavily advertised or known in the United 
States. US residents and citizens could have traded them at the time of the hedge, however. The 
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Fig. 10. Relative costs of the Paine Webber and Salomon NSA call warrants using 20% NSA historical 
volatility, April to October 1990. (0) PXA, (+) SXZ, and (—) normalized Nikkei. 


reasons for the mispricing are several. The puts were complex for most ordinary investors and ail 
but experienced option traders in Canada likely evaluated them incorrectly. Evidence of this is 
found in the literature on them from various Canadian brokerage houses. Many investors in 
Canada and academics — see for example, French and Poterba (1991) and Ueda (1990) — were 
quite convinced that the Japanese market was overpriced. Even the Canadian investors bidding 
up of the price did not prevent them from making considerable profits later when the Nikkei fell 
sharply". The Canadian puts finally declined into their theoretically correct pricing about a month 
after the US puts were trading on the American Stock Exchange. 

The studies of Bernard and Thomas (1989, 1990), Affleck-Graves and Mendenhall (1992) and 
Aberbanell and Bernard (1992) show the slowness of individual stocks to react to new earnings 
information. Hence, it is not surprising that this convergence to efficiency of more complex cross 
border investments would take about a month to occur. 


1 According to Slocum (1993) investors in the four Bankers Trust warrants made a total profit of about Cdn$500 million. 


Chapter 7: Risk Arbitrage in the Nikkei Put Warrant Market of 1989-1990 205 


Risk arbitrage in Nikkei put warrant market 267 


Table 7. The Salomon Nikkei January 1993 put warrants record at predicting the following 
day’s change in the NSA in Tokyo, | August to 6 September 1990. 


Date SXA Implied Nikkei Nikkei close Prediction* 
8/01 9.375 30 000 30838 fall 

8/02 10.50 29181 30245 fall 

8/03 10.875 28 908 29516 fall 

8/06 12.625 27635 28 600 fall 

8/07 12.00 28 090 27 653 rise 

8/08 11.125 28 726 28 509 rise x 
8/09 11.75 28272 27616 rise X 
8/10 12.875 27453 27 330 rise X 
8/13 13.75 26817 26176 rise 

8/14 12.875 27 453 26673 rise 

8/15 12.375 27817 28112 fall 

8/16 13.875 26726 27 549 fall 

8/17 14.125 26 544 26 787 fall 

8/20 13.50 26 998 26 490 rise x 
8/21 14.75 26089 26 298 fall 

8/22 16.00 25179 25211 fall 

8/23 18.125 23 633 23 738 fall X 
8/24 16.625 24725 24166 rise 

8/27 14.625 26 180 25142 rise 

8/28 15.50 25 543 25711 fall 

8/29 16.00 25179 24 895 rise 

8/30 16.00 25179 25 670 fall X 
8/31 15.625 25452 25978 fall 

9/03 closed na 25420 

9/04 16.25 24997 24908 risc x 
9/05 17.00 24 442 24078 rise x 
9/06 17.75 23 906 23812 


* The prediction is for a fall (risc) in the Nikkei on day ¢ + 1 if the implied Nikkei on day ¢ is below (above) 
the close on ¢. 
Source: Modified from The Wali Street Journal (1990). 


Interestingly, Bankers Trust also issued Canadian dollar against the US dollar put warrants in 
June 1990. These traded on the Toronto Stock Exchange at a time when many Canadians expected 
a sharp decline in the Canadian dollar while US exchange traded options on the Canadian dollar 
were actively traded. These puts were also overpriced and they stayed overpriced for the entire year 
until they and the US puts expired worthless in June 1991. This latter case has some parallels with 
the Nikkei put hedge but important differences. 

With the Canadian dollar puts the difference in price could be explained by the fact that it was 
extremely difficult to short these puts!!. For those that did, including two of the authors, there were 
considerable profits in a percentage but not absolute basis. In contrast it was not difficult to short 
the Canadian Nikkei puts in large numbers. Salomon Brothers and other issuers were in the posi- 
tion to participate in the mispricing hedge. Presumably for business reasons concerned with selling 
such products, they did not converge the mispricing to efficiency sooner than February 1990. The 
market did price the relative values of the type IT and type ITI Canadian puts correctly. Other 


1 Another example in Holland, with similar mispricings related to the inability to short the overpriced warrants, is discussed 
by Veld and Verboven (1992). 
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reasons for the temporary mispricing of the Canadian puts as discussed in the text are the risks of 
buy-ins, cross border risks, small relative currency risks and differing credit risks. The latter is 
mitigated somewhat in the hedge: sell Canadian Bankers Trust puts and buy US Bankers Trust 
puts. Still if an extraordinary event occurred and there was no trading in the Nikkei index the 
liquidity of the two types of puts could have been different. 

The second hedge involved securities of fixed versus floating exchange rate on the American 
Stock Exchange. The explanation for this mispricing which also lasted about one month seems to 
be the price effect and the different ways one can view the currency risk and pricing. The price effect 
where the Bankers Trust US warrants had NSA sizes two and half times as large as the Kingdom of 
Denmark and the Salomon puts is totally analogous to the effect of low priced stocks in January 
studied by among others Blume and Stambaugh (1983). It is known that much of the January small 
firm effect can be equally viewed as a low price effect. Hence, it is not surprising that in the very 
beginning of their trading the much higher nominally priced BT warrants traded for somewhat 
lower actual prices. Another possible reason for the discrepancy involves the currency risk. The 
theoretical models assume that currency prices are based on their forward rates. Hence, if investors 
were assuming that the lower yielding yen would not appreciate against the higher yielding US or 
Canadian dollars as evidence summarized by Froot and Thaler (1990) suggests for such currencies, 
then higher prices were warranted for the fixed exchange rate puts'”. Since even this explanation, 
that is, assuming that the forward rate equals the spot is not enough to explain the full extent of the 
mispricings it appears that a combination of the two effects and the premium that is warranted for 
eliminating the currency risk is the logical explanation. 
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Chapter 8 


DESIGN OF ANOMALIES FUNDS: 
CONCEPTS AND EXPERIENCE 


Dennis R. Capozza and William Ziemba 


INTRODUCTION 


During recent years, many researchers and practitioners have hunted for 
deviations from market efficiency in a quest for excess returns. The big game 
in this hunt is a bona fide “anomaly.” The hunters have flushed out some 
veritable “beasts” that are disconcerting to the more doctrinaire efficient market 
advocates but heartening to the antagonists. However, if anomalies are to be 
more than elusive and ephemeral prey, a strategy must be developed that 
realizes an excess return. Implementation of such an anomalies strategy is the 
subject of this paper. 

In this context, market efficiency is usually defined in terms of the capital 
asset pricing model (CAPM). The simple version of the CAPM can be written 


(rs) = rp + Bl Elm) — ry) 
where E(r,) is the expected return on a risky asset 
ry is the yield on a riskless asset 
Erm) is the expected return on the market 


Bs ìs the relative risk measure for the risky asset. 


As suggested by many writers, if the model is correct and security markets 
are efficient, security returns will, on average, conform to the above relation. 
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Persistent departures from the CAPM violate the joint hypothesis that both 
the CAPM and the efficient market hypothesis are correct. “Anomalies,” or 
security market regularities are departures from the joint hypothesis. While 
CAPM is the dominant asset pricing model against which efficiency has been 
tested and anomalies observed, these anomalies appear to be pervasive even 
when other valuation theories such as APT are used (see Giiltekin and Giiltekin 
1987; Cho and Taylor 1987). 

In the next section we describe ways to structure an anomalies fund. In the 
third section we survey some exploitable anomalies. The fourth section outlines 
the experience of two anomalies funds. The fifth discusses problems of 
implementation and the final section concludes. 


WHAT IS AN ANOMALIES FUND? 


If a true anomaly exists it should be possible to construct a portfolio of 
securities that will earn an excess return with little or no risk. For example, 
suppose low P/E stocks outperform high P/E stocks on average. Then a 
portfolio that is long low P/E securities and short high P/E securities in the 
right proportion should earn the excess return with little or no systematic risk. 
In practice it may be difficult to eliminate all risk with offsetting short positions, 
but in theory it is possible to do so. If a portfolio is constructed in this low- 
risk manner we have a “pure” anomalies fund. A pure anomalies fund should 
have an excess return with little or no systematic risk. 

To create a portfolio with a higher expected return, one need only reduce 
the short positions in the portfolio. This portfolio with only long positions 
could be constructed with an overweighting of “anomalous” securities. The 
portfolio will be risky; but if the stocks are bona fide anomalies, the returns 
on this portfolio should exceed those on the market over a sufficiently long 
horizon. The portfolio will have risk characteristics similar to the market 
portfolio unless the anomaly stocks (e.g., low P/E) are exceptionally risky.' 
If portfolios are constructed in these risky ways, the portfolio is an anomalies 
“growth” fund. 

Most fund managers are, in effect, following a strategy similar to this latter 
concept. In this paper, however, we reserve the term “anomalies fund” for 
portfolio strategies based on the anomalies which have been documented in 
the academic literature to have statistically significant excess returns. 


EXPLOITABLE ANOMALIES 


While a wide variety of phenomena has been proposed as possible exploitable 
anomalies. the best documented anomalies include those associated with 
seasonality (January, monthly, weekly, and holiday). insider trading, 
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unexpected earnings, and closed-end mutual funds. Other anomalous behavior. 
such as during tender offer repurchase of shares (Lakonishok and Vermaelen 
1988), are not explored here. Surveys of equity return anomalies are available 
in Jacobs and Levy (1988) and Keim (1986). 


Seasonality -January 


The abnormal return at the beginning of the year, the “January effect,” has 
reccived a great deal of attention in the academic literature. Because of its size, 
persistence, and pervasiveness, the January effect is the “beast” among the 
anomalies. For small firms 40% of the return for the year occurs in January 
(Rogalski and Tinic 1986). The same seasonal pattern appears in stock markets 
around the world (Haugen and Lakonishok 1988; Gültekin and Gültekin 1983). 

In attempting to explain the January effect, researchers have found it to be 
more pronounced for small firms (Keim 1983a:; Reinganum 1983; Roll 1983; 
Ritter 1988), for low P/E stocks (Jaffe, Keim, and Westerfield 1989), and for 
stocks that have declined sharply (DeBondt and Thaler 1985, 1987). When 
combined with these other anomalies, the January seasonal is large enough 
to support active portfolio strategies. That is, since the excess return exceeds 
typical transactions costs, an investor should be able to buy stocks in December 
for liquidation at the end of January profitably. Lakonishok and Smidt (1984), 
however, caution that the small-firm effect in January arises from errors in 
variables due to non-trading of small-firm stocks at the turn of the year. The 
use of the bid-ask spread mean for price by CRSP (Center for Research in 
Security Prices) and the regulation of market-makers may lead to biased results 
in empirical tests. However, even these authors do find a significant abnormal 
return for small firms, before brokerage, at the end of the calendar year. 

This active strategy contrasts with the “virtual” strategy that must be used 
with many of the smaller anomalies. When transactions costs exceed the 
anomaly'’s excess return, for example, with the “Monday effect” discussed 
below, an investor cannot profit from an active strategy of buying and selling. 
Instead, the anomaly can help to displace trades in time to favorable moments. 
For the Monday effect the investor would delay purchases until late Monday 
and advance sales to the preceding Friday close. 

Many active strategies can be devised to exploit the January effect. These 
include buying stock of small firms with low P/E ratios in December, buying 
firms with the largest price declines, and buying Value Line futures while 
shorting the S&P futures (Clark and Ziemba 1987). This last strategy would 
be suitable for a pure anomalies fund while the others are suitable for an 
anomalies growth fund. A pure anomalies fund could also short large firms 
with high P/E ratios, index futures, or options. This will eliminate the risk 
in the long positions of the small firm, low P/E strategy. 
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Seasonality-Monthly, Weekly, and Holiday Seasonals 


Other well-documented seasonal anomalies include the monthly, weekly, and 
holiday effects. Ariel (1985) found that returns in the first half of the month 
are significantly larger than returns in the second half (except in February). 
The first nine trading days have an average return of 1.4%, while the last nine 
average —.02%. 

French (1980), Gibbons and Hess (1981) and Keim and Stambaugh (1984) 
find a small but significant negative average return on Monday of about —.14%. 
Ariel (1987) reports a seasonal around legal holidays. 

None of these is large enough for the average investor to exploit via an active 
strategy, although the monthly seasonal may be exploitable for low-cost 
traders. Therefore the virtual strategy outlined above must be employed. Stock 
purchases can be timed to coincide with the beginning of the favorable period 
and stock sales can be timed for the end of favorable periods. 


Insider Trading 


Insider buying and selling signal the future price movements of a stock for 
up to six months following the insider trades (Seyhun 1986, 1988). This 
anomaly is small but can be combined with other anomalies such as the small- 
firm, low P/E, and overreaction effects of January. A pure anomalies fund 
would buy firms with net insider buying and sell short firms with net insider 
selling. An anomalies growth fund would simply overweight the portfolio with 
insider buying securities and underweight with insider selling securitics. 


Closed end Funds 


Thompson (1978) has found that buying closed-end funds that trade at a 
large discount to net asset value (e.g., greater than 20%) provides excess returns. 
This anomaly is both small and risky but can be combined with other anomalies 
to yield a tradeabie strategy. A pure anomaly fund would buy closed-end funds 
at large discounts and short funds at small discounts. 


Unexpected Earnings 


Foster, Olsen, and Shevlin (1984) and Jacobs and Levy (1988) find persistent 
price behavior around the announcement of unexpected earnings. The excess 
returns persist for six months following the announcement. Bernard and 
Thomas (1989) argue that most of the drift following earnings announcements 
takes place around the subsequent quarter’s earnings announcement. That is, 
it appears that the market fails to appreciate fully the implication of one 
quarter’s earnings for the next quarter’s earnings. Strategies similar to those 
outlined above can be used to exploit this anomaly. 

A summary of some of these findings appears in Table 1. 
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Type 


217 


A Partial List of Findings on Stock Market Anomalies 


Authors 


|. Small firm and P/E effect 


a) Small firm 


b) P/E Effect 


IL Seasonality 


a) Monthly 


b) Weekly 


Ht. Insider 
Trading 


IV. Closed End 
Funds 


V. Unexpected 
Earnings 


Keim (1983) 


Reinganum (!983) 


Roli (1983) 


Rogalski and Tinic 
(1986) 


Ritter (1988) 


Jaffe, Keim, 
Westerfield (1989) 


Ariel (1987) 


French (1980) 


Gibbons & Hess 
(1981) 


Keim & Stambaugh 
(1984) 


Seyhun (1986) 


Thompson (1978) 


Foster, Olsen and 
Shevlin (1984) 


Bernard and 
Thomas (1989) 


Finding 


Annualized abnormal returns difference of 30.3% 
between small and large firms. Difference is 15.4% 
in all months excluding January (1963-1979). 


Negative correction between size and abnormal 


returns,e even after adjusting for tax loss selling 
(1963-1979). 


January returns primarily on last day of December 
and first four trading days of January. 6.89% for 
NYSE, 14.2% for Amex: 5 day return (1963-1980). 


EW market portfolio earns average daily return of 
0.34% in January—which ts at least 4 times larger 
than other months (1963-1982). 


Mean difference between small and large firm 
returns on first 9 days of January is 0.00876% 
(1970-1985). 


Moving from lowest to highest quintile of E/P 
ranked firms increases returns by 3.2% annually. 


Cumulative return of 1.4% on first 9 days of 
month, —0.02% return on last 9 days of month 
(1963-1981) 


Mean Monday returns = —. 166 (1953-1977) 
Mean S&P 500 Monday returns = —.13% (1962- 
1978) 

Mean Monday returns = —.199% (1928-1982) 


1.1% abnormal return following insider trade 100 
days after reporting (1975-1981) 


Average abnormal return in excess of 4% per year 
on trading on closed end funds trading at a dis- 
count (1940-1975). 


Sign and magnitude of earnings forecast error 
explains (81% of) post announcement drift in stock 
returns. 


5% difference between excess returns of high and 
low decile SUE firms over 60 days following unex- 
pected earnings. 
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EXPERIENCE 


The ultimate test of an anomaly is whether strategies based on the anomaly 
yield excess returns. Unlike a statistical study, a portfolio manager must deal 
with commissions, bid/ask spreads, unanticipated events, power and telephone 
outages, human emotions, “fast markets,” and other practical considerations 
that cannot be fully accounted for in a study. In this section we describe the 
results of an attempt to follow an anomalies strategy by the Canadian Asset 
Research Institute (CARI). 

Canadian Asset Research Institute has managed both a pure anomalies fund 
and an anomalies growth fund on an experimental basis since early in 1987. The 
CARI Anomalies Fund attempts to exploit the anomalies described above by 
buying securities with the indicated characteristics and selling index futures or 
options to reduce the systematic risk. During seasonal periods that are particularly 
favorable (e.g., early January) the fund may leave the long position unhedged 
or only partly hedged by short futures and options. 

The success of the fund at converting the anomalies into excess returns at 
low risk can be assessed from Table 2. 

As can be seen in the table the first year of operation, which includes 
the October crash, was not very successful for this fund. The excess return 
is negative and the risk level is moderate with the beta at .65 (65% of the 
risk of the market index). The second year on the other hand, is much more 
successful. The excess return is about 1% per month while the risk is very 
low with beta equal to .15. Unsystematic risk also declines in the second 
year. 

The CARI Growth Fund attempts to exploit the same anomalies as the 
CARI Anomalies Fund but leaves the longs unhedged or less hedged. Figure 
1 below summarizes the experience of this fund relative to the S&P 500. As 
with the Anomalies Fund, the CARI Growth Fund is more successful at 
achieving its aims in 1988 than in 1987. 


Table 2. CARI Anomalies Fund Risk and 
Return (percent per month) 
(rs: = Gs + Bstm + Est) 


1987 1988 
alpha —.311% 1.45% 
beta 0.65 0.15 


unsys. risk 4 1% 3.3% 


617 


na 


Net Asset Value 
Monthly Data 1987/88 
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Figure 1. CARI Growth Fund and S&P 500, 1987-88 
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PROBLEMS OF IMPLEMENTATION 


The most difficult problems associated with the anomalies strategies arise in 
hedging positions to reduce risk. As a result, the pure anomalies concept is 
more difficult to implement than the anomalies growth concept. 

To maintain a proper neutral hedged position, both the risk in the long 
position and the risk in the hedging instrument (stocks, stock index futures, 
or stock index options) must be monitored. Since the risk can change both 
with the portfolio positions and with market fluctuations, there is considerable 
opportunity for both errors and transactions costs to rise. Events like the 
October 1987 meltdown tend to greatly increase total trading costs 
(commissions and execution expenses). For the CARI Funds many problems 
with hedging the positions arose in the first year—especially as a result of the 
October 1987 crash. 

When anomalies are exploited by taking short positions in stocks, it is 
important to be able to earn full interest or invest the proceeds on the shorts. 
The excess returns in most anomalies is small so that the loss of interest can 
eliminate the excess return. Usually it is not possible to obtain full interest 
credit. Consequently, alternative positions in index futures or index options 
where interest is credited tend to be more attractive. 

Many of the anomalies involve positions in small firms. These securities tend 
to be thinly traded with large (in percentage terms) bid-ask spreads (Stoll and 
Whaley 1983). As a result, great care must be taken in the execution of trades. 
Large orders will drive prices up by more than the small excess return predicted 
by the anomaly and eliminate any expected gains. Low priced securities are 
particularly vulnerable since bid/ask spreads on these securities can exceed 10% 
of the market price. 


CONCLUSION 


Recent research has uncovered anomalies that violate the joint hypothesis of 
market efficiency and the CAPM. Experience suggests that the anomalies 
research can help to construct portfolios that earn excess returns. The most 
significant of these anomalies is the January effect which is intertwined with 
other anomalies. Only time will tell whether these anomalies will persist in the 
face of an onslaught of portfolio managers attempting to exploit them. 
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Chapter 9 


Land and Stock Prices in Japan 


Douglas Stone and William T. Ziemba 


trillion. This was more than 20 percent of the world’s wealth, or to put it 

in some other contexts, about double the world’s equity markets or half 
again as large as the world’s bond markets. Japanese land was then valued at 
about five times that of the United States; the land under the Emperor’s Palace, 
which is about three-quarters of a square mile, was estimated to be worth about 
the same as all the land in California or in Canada. Real estate assets of 
Japanese corporations grew by $2.8 trillion from 1986 to 1988, an increase in 


L: late 1991, the total land value in Japan was estimated at nearly $20 


valuation roughly equal to the size of the Japanese gross national product. 

An equally dramatic rise in stock prices accompanied the rise in land 
prices. At its peak in December 1989, the Japanese stock market had a value of 
about $4 trillion, which was about 44 percent of the world’s equity market 
capitalization. To put that figure in perspective, the value of the equity on all 
the stock exchanges in the United States in August 1992 was less than $5 
trillion. But then, from its peak in December 1989 to August 1992, the 
Japanese stock market fell by over 60 percent. Various indices of speculative 
land values fell a similar amount. Meanwhile, other land prices—industrial, 
commercial, residential, as measured by various indices—fell 15-20 percent. 

This paper discusses the rise of Japanese stock and land prices in the past 
four decades and their dramatic decline in the early 1990s. To what extent can 


m Douglas Stone is a member of the Research Department of the Frank Russell 
Company, Tacoma, Washington. William T. Ziemba is the Alumni Professor of Manage- 
ment Science, University of British Columbia, Vancouver, Canada. 
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fundamental factors explain both the price levels and the returns from land 
and stock in Japan? Are land prices driving stock prices, or the other way 
around, or are stil] other factors affecting both? How has government policy 
interacted with the price changes? In practice, it is very difficult to solve the 
problem of separating the explanation that a bubble occurred from the possibil- 
ity that the underlying fundamental problem is misspecified, a difficulty ex- 
plained by Flood and Hodrick (1990). We believe that the bulk of the rise in 
Japanese asset prices from 1985-89 and the decline during 1990-92 was 
driven by interest rate and credit market conditions. However, in certain 
speculative land markets, there is some evidence in support of the bubble 
hypothesis. 


Rational Explanations for Japan’s Land Prices 


The most expensive land in the world is in central Tokyo. In the Tsukamoto 
Sozan Building in Ginza 2-Chome, one square meter was priced at 37.7 million 
yen, or about U.S. $279,000 at the December 1990 exchange rate of about 135 
yen per U.S. dollar. The average lot for a U.S. house, at the 1990 price of 
$9000 per square meter in Japan, would cost about $9 million. Tokyo is 
especially crowded and congested and space is in great demand. For example, 
the fines for parking in a no-parking zone are as high as 200,000 yen (about 
$1667), which is about two weeks pay for an average worker. A cup of coffee at 
a major Tokyo Hotel can cost $4, with the high price based on the product, the 
service, and the high cost of the rent of the land. 

It’s clear that economic fundamentals can at least explain why Japanese 
land prices are higher than elsewhere. Japan’s population, at 120 million, is 
about half that of the United States. However, Japan’s area of 377,000 square 
kilometers is only about 4 percent of the United States, an area about the size of 
Montana. Moreover, the habitable area of Japan is only about 80,000 square 
kilometers, about 1/60 of the U.S. amount, an area about the size of South 
Carolina. Thus, in terms of density, Japan’s population per unit of habitable 
area is about 30 times that of the United States. Japan’s GDP per unit of 
habitable land is approximately 21 times as large. 

Moreover, the use of Japan’s habitable land is severely restricted by various 
regulations and tax laws. For example, the tax structure creates low costs for 
holding land and very high costs of selling land. Inherited land is valued well 
below the market price for tax purposes. Since other financial assets are valued 
at full market price, investors, seeking to lower ordinary and estate taxes, invest 
and hold land often with borrowed money. Existing land is underutilized 
because of regulations on zoning, height restrictions, and so on. Laws dating 
back to World War II, particularly the Building Lease Law, have made it 
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virtually impossible for a landlord to evict tenants even when the lease expires.’ 
As a result, rather than building rental property, a large number of land-owners 
hold vacant land or turn the land into parking lots while waiting for an 
attractive opportunity to sell.” Noguchi (1991) and Mera (1992) argue that it is 
land utilization, not the amount of land in Japan, that is crucial. 

Perhaps the most extreme illustration of these restrictions is Tokyo itself. 
Remember, Tokyo has the world’s highest land prices and it is an outlier in 
virtually all comparisons of value, be they price or rent. However, there is still 
considerable vacant and underutilized land in greater Tokyo. A survey by the 
Ministry of Construction concluded that about 160,000 acres were available for 
housing. This includes 89,000 in farmland, 56,800 in underutilized land (va- 
cant and parking lots) and 14,800 in vacated factory sites, publicly held idle 
land, and land of the now defunct National Railways. In addition, most builders 
do not utilize the legally allowed capacity. According to the Japan National 
Land Agency, as of 1986 the legally allowable floor-to-land-area ratio in Tokyo 
was 242 percent, however, the actual figure was 95 percent, for a usable ratio of 
39.3 percent. 

Boone and Sachs (1989) and particularly Boone (1989) have attempted to 
use facts like these to justify the high land values in Tokyo and the rest of Japan 
based on rational economic models. Their explanations fall into three cate- 
gories. First, Tokyo is an extreme outlier in prices and rents, so the high prices 
might be due to inefficiencies and excess concentration in that area. However, 
even without Tokyo, Japanese land prices greatly exceed those in the United 
States and Europe. Every prefecture in Japan has a measure of land value 
relative to GNP which is greater than any of the major industrialized countries. 

Second, distortions between the urban and rural sector, including tax 
policy and agricultural protection, also affect land prices by disfavoring urban 
land, and thus increasing its scarcity. Agricultural land is taxed less than a tenth 
as heavily as residential, commercial or industrial land. Ando and Auerbach 
(1990) have estimated that taxing agricultural land at the same rate as other 
land would increase the availability of residential land enough to reduce 
housing costs by 28 percent. This estimate seems high, since it seems to assume 
that freed-up rural land can be a perfect substitute for urban land. Boone 
(1989) took this approach further in two ways: he incorporated data on how 
rents decline as one moves away from a city center, and found this would lead 
to a 3 percent drop in aggregate land value. However, he also calculated that if 
agricultural policy were liberalized, a 9-12 percent drop in aggregate land 
values would result. Because these factors explain only a small proportion of 


"Legal changes in 1992 may make it easier to evict tenants. 
? Mera (1992) shows the optimality of this decision from an economic point of view. He also shows 
how this could drastically change with a substantial increase in land taxes. 
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the difference in land prices, Boone suggests that the major explanation for the 
high land values in Japan are macroeconomic factors. 

In a series of models, Boone is able to rationalize the high land prices. The 
long-run determinants of land values (measured relative to GNP) are the share 
of rents in an economy-wide aggregate production function and the capitaliza- 
tion rate for future rents. In turn, rents depend on productivity growth in the 
economy, high utilization of land, and low property taxes (to keep the rate of 
return high). Future rents depend on these factors as well, discounted with the 
appropriate interest rate. Several models, along the lines of Cass (1972) and 
Solow (1973), all predict that economies with some combination of low rates of 
time preference, high productivity growth rates, and low property taxes 
(through their effect on the required return for land) would tend to have high 
measures of land value relative to GNP in the steady state, high growth rates 
and high savings rates. 

From the standpoint of models like these, Japan seems to have all the 
ingredients for high land values. Average property taxes are low: the average 
rate was just 0.39 percent in Japan in 1988, less than one-fourth the 1.73 
percent property tax rate in the United States. So it has been easy to hold land 
with low taxes and low costs of borrowing and interest charges that are tax 
deductible. The intensity of land use in Japan is 20 to 30 times that in the 
United States. Real growth rates in Japan have exceeded those in the rest of the 
world over the last few decades, and estimates of the relative growth rates by 
Data Resources in 1989 were 4.0 percent for Japan and 2.3 percent for the 
United States for the next decade. Finally, Japan has had a low rate of time 
preference, perhaps one-quarter to one-third the U.S. levels, with high savings 
despite low interest rates. 

In 1985, Japan’s land value to GNP compared to U.S. land value to GNP 
was in a ratio of 2.88: 1. Given the much smaller land area of Japan, this figure 
implies Japanese land prices in the range of 80-120 times U.S. land prices. 
From 1985 to 1987, the ratio of Japan’s land value to GNP, compared with U.S. 
land value to GNP, increased to 4.3: 1. Boone (1989) calculates that the 1985 
ratio is plausible enough, based on the underlying assumptions. The jump to 
1987 requires that Japan’s growth rate be perceived to increase about 2 percent 
faster, forever. Thus, Boone has argued that as ridiculous on the surface as the 
Japanese land prices are, they are more or less in the ballpark of a rational 
economic explanation, provided that Japanese required rates of return remain 
low and real growth remains high.* This underscores the crucial importance of 


“along with Boone (1989), Noguchi (1991), Rose (1990) and Ziemba (1991a) have estimated land 
price equations based on fundamental data. Using data on 47 prefectures from 1977-87, Noguchi 
(1991) found that the log of price in 100 yen per square meter was negatively related to the 
long-term interest rate, and positively related to the real prefecture GDP per unit of urban land, 
the population growth rate within that prefecture and neighboring prefectures, and the share of 
secondary and tertiary industries in the prefecture. Rose (1990) shows that land values and housing 
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the interest rate used by the market, which will become even more apparent in 
discussing the reason for the decline in asset prices from 1990-92. 

In short, Japanese land prices are largely explainable through standard 
economic variables. Moreover, if the government in Japan wished to reduce 
land prices, it has at its disposal many tax and regulatory instruments which 
could do so. Unless such steps are taken, however, Japan’s land prices will 
probably remain extremely high and unaffordable for most of its citizens. 
Current Japanese government economic policy is to reduce the price of the 
average accommodation to five times the average annual salary. The 1990-92 
decline has so far reduced this ratio from about nine to about seven. 


The Relationship between Land and Stock Prices 


Land and stock prices have been in a very close, positive relationship since 
1955. In fact, Figure 1 shows that land and stock rose almost the same amount 
from March 1955 to September 1992. Calculations in Ziemba (199 1a) show that 
the stock market level is most closely related to commercial land prices and the 
correlation in these biannual series up to 1988 was over 99 percent. In the 
period 1955-71, land prices increased more than stock prices, with land 
increasing 100-175 times in the six largest cities and stocks increasing about 
109 times. From 1971-89, the stock market increased about 20 times while the 
various types of land only increased about five to ten times in the six largest 
cities, and three to four times in the country as a whole. 

However, stocks have been much more volatile than land prices. For 
example, the 5 to 8 percent decline in various land prices in 1973-74, following 
the first oil crisis, is the only decline in those series before the early 1990s. 
Stocks, on the other hand, have had 29 declines of 10 percent or more from 
1949 to 1992. Figure 2 illustrates the differences in stock and land volatility, 
which is dramatic even with biannual data. 

Since many economists have little familiarity with Japanese statistics on 
land and stock prices, a few words describing the data seem appropriate. Data 


rents are positively related to population and/or per capita income, inversely related to the interest 
rate, inversely related to the supply of land and inversely related to distance from the city center. 
Rates of growth in income or population generally do not contribute significantly to the explana- 
tion of prices. Per capita real income or population, and the real interest rate or the inflation rate 
exert some influence over land value in the directions implied by theory. 

Interestingly, Rose also found, using data from 1954 to 1987, that the real interest rate (the 
“all banks average agreed interest rate on loans and discounts” net of the rate of increase in the 
CPI) and the general price inflation have a correlation of nearly — 1 (during 1969 to 1984 it was 
— 0.99) so these variables are essentially substitutes. Part of the reason for this is that Japanese 
inflation is highly dependent upon the yen-U.S. dollar exchange rate which in turn is related to the 
interest differential. 
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Figure 1 
All Land prices and the Nikkei Stock Average, 1955:1-1992:2 
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on land prices is available from the Japan Real Estate Institute on indices of 
land prices for industrial, residential, commercial, the highest-priced lots, and 
all land in the six largest cities and throughout the country from 1955 to the 
present.’ In addition, the Japan National Land Agency provides appraisal 
information on several thousand properties on an annual basis. 

The data are appraisal-based. Simple averages of samples of three sets of 
ten lots in each city form the indices. The sampling procedure separates land 
into high, medium and low grades reflecting location, social circumstances, 
yield, and so on. Lots are randomly selected from these three classes. The 
procedure produces useful but not ideal data for analysis because the indices 
tend to be lagged (largely based on dated information) and smoothed (the data 
tend to be averaged and when prices fall there is a tendency not to sell 
properties). Glaringly apparent in Figure 1 is the lagged price declines of land 
in 1992. Gyourko and Keim (1992) discuss some of the problems associated 
with real estate data. Data are available for the end of the fiscal year (March 31) 
and the half fiscal year (September 30). 

Daily transaction data of a price-weighted index of 225 large capitalized 
stocks called the Nikkei Stock Average (NSA) is available beginning when the 
stock market in Japan reopened in May 1949, following the occupation after 
World War II. This index is comparable to the Dow Jones Industrial average in 


4 The six largest cities are Tokyo, Yokohama, Nagoya, Kyoto, Osaka and Kobe. The country-wide 
indices are based on 140 cities and recent data on 225 cities. 
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Figure 2 
Rates of Return All Land and the Nikkei Stock Average, 1955:1-1992:2 
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Source: Stone and Ziemba (1992) 


its construction, with the same wide acceptance and problems. A more repre- 
sentative index of the market is the Topix, which is the value-weighted index of 
all Tokyo Stock Exchange “first section” securities, some 1225 as of August 
1992. The “first section” amounts to about 85 percent of the trading and 
market capitalization in Japan. Daily values of this index are available since its 
use began in 1968, and earlier values can be calculated from available data. The 
Topix is comparable to the S&P 500.° The NSA and Topix are highly intercor- 
related and they are both used in land and stock price relationship studies. The 
Topix is more highly capitalized because it is value-weighted, even though the 
NSA has the largest individual capitalized stocks. The Topix is also much more 
highly concentrated in banking stocks, who are the owners of most of the 
land-related debt in Japan. 

Stock prices of individual securities and land are often intertwined in the 
Japanese market. Cutts (1990) discusses the Japanese policy of borrowing on 
stocks to buy land, and the reverse. The major purchases of land during the 
1970s and 1980s were by corporations; over this time, the household sector has 
been a net seller to the non-financial corporations and other sectors. The 
corporations have had the resources to purchase land, some of which are used 
to house their employees with subsidized rent. The net purchases of land by the 
non-financial sector was 28 trillion yen during 1985-89 versus 3 trillion yen in 


®' There is extensive futures trading including substantial index arbitrage in the NSA in Singapore, 
Osaka and Chicago. In fact, the NSA futures contract has higher dollar volume than any other 
index futures contract, including the S & P 500. 
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the previous five years. The extra 25 trillion yen was about 4.4 percent of their 
567 trillion yen in holdings at the end of 1989. 

Ziemba and Schwartz (1991) discuss examples of the land component of 
major corporations including adjustments to price earnings ratios that separate 
out the hidden assets (land and cross-held stock) from the firm’s earnings. If 
one does the latter, using appropriate indices for valuation, the price-earnings 
ratios of many stocks fall dramatically. For example, Nippon Steel, Japan’s 
largest stecl company, had a price-earnings ratio of 101.5 in 1988. Without 
land it was 83.3, without buildings it was 74.3 and only 5.3 without cross 
holdings.® 

Do Japanese stock and land prices literally move together? Or is one 
leading the other? There is strong evidence from a variety of sources that stock 
price changes lead land price changes. For example, Stone and Ziemba (1990) 
found strong evidence that stock price returns led land price returns from 1972 
to 1987, rather than the reverse.” Canaway (1990) estimated the lag to be 
eleven months, which is consistent with the general character of Stone and 
Ziemba’s (1990) calculations. When we updated the calculations in Stone and 
Ziemba (1990) for our 1992 paper, including data for the period 1987 to 1992, 
there was little change in the values, and the conclusion that stock returns lead 
land returns is maintained. Hamao and Hoshi (1991) also find that excess 
returns of stocks are useful in predicting excess land returns, while excess land 
returns are not useful in predicting stock returns.” 

This sort of strong connection between stock and land prices does not seem 
to occur everywhere; for example, investigation of data for the United States 


ĉA simple model to value a Japanese corporation is to aggregate the land, securities and other assets 
net of debt, using appropriate discount factors. While inventories, receivables, bonds and equity are 
valued at close to market interest rates, the land components held directly or indirectly are priced 
by the market at 10-20 percent of their current market price and evaluated in financial statements 
at extremely low book values. The latter are often one-thirtieth of current market value. The 
market is able to discount large land holdings because it is virtually impossible to initiate a hostile 
takeover of a Japanese company (Zicmba and Schwartz, 1992). 

‘Stone and Ziemba (1990) estimated econometric models that use past quarterly stock price returns 
(measured by the Topix index) in an attempt to predict the current quarter’s overall land price 
returns, including a dynamic regression model with autoregressive terms, a Box-Jenkins modcl, 
and an autogressive conditional heteroscedastic model. The estimation period was 1972:2 to 1989:1 
and the forecast period was 1989:2 to 1990:3. During the estimation period, the models all fit well 
with coefficients having the expected signs. However, only the dynamic regression model predicted 
well out of sample. Stone and Ziemba (1992) reestimated a similar equation for the NSA, rather 
than the Topix index, to include 1990-91 data. Again, the equation predicted well in the forecast 
period which suggests that the past relationship of lagged stock market returns with current land 
returns is still valid. Based on these equations, the recent decline in stock prices should imply a 
drop of about 20 percent in Japan’s land prices during 1992 and 1993. 

5tamao and Hoshi (1991) found that land index returns and the Nikkei 225 price returns are not 
co-integrated. Two variables are co-integrated if they move together in the long run. Using data 
through 1992, Stone and Ziemba (1992) also found this lack of co-integration. Recent research 
indicates that tests of co-integration may not be very powerful (Hakkio and Rush, 1991). Further- 
more, while land and stocks have moved together in the past, as shown in Figure 1, they have 
diverged significantly at various umes, including the recent past. 
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found no significant relationship between stock and land price returns (Stone 
and Ziemba, 1990; 1992). However, there are significant relationships between 
lagged real estate stock returns and real estate returns in the United States 
(Gyourko and Keim, 1992). But before discussing possible reasons for this 
connection in Japan, we should first turn to an issue that may have occurred to 
some readers—the problem of speculative land indices. 


What About Speculative Land Prices? 


While stock prices may generally lead land prices, it has sometimes been 
argued that one class of land—speculative land—leads stock prices. Speculative 
land is the term given to land which is held for expectation of gain or as a 
hedge against inflation; it is contrasted with land that is an essential input for 
business operations or housing. In Japan, speculative land is typically highly 
levered, while essential use land is not levered. As one example, in early 1988 
The Economist magazine hypothesized that golf course membership prices gen- 
erally lead the stock market, based largely on the fact that golf courses fell in 
the quarter prior to the October 1987 stock market crash. 

Golf course membership indices for various cities and areas of Japan, 
which are compiled by the Nihon Keizai Shimbun, Inc. (Nikkei), provide one 
measure of speculative land values. The value of all golf courses in Japan at the 
end of 1989 was about $500 billion, a value double the Australian or Swiss stock 
exchanges. There is a strong correlation between golf course membership 
prices and other speculative land prices, such as condominiums, but better data 
is available on golf course memberships. These indices, computed weekly since 
the end of 1981, are based on actual sales of memberships at the more than 400 
golf courses in Japan. The index is updated at the end of each week by Nikkei 
from interviews with the six major golf membership dealers who deal in the top 
400 clubs. The brokers supply the current bid and ask prices of each club and 
the index is the arithmetic average of these.” Membership in a golf course in 
Japan allows play by the member at nominal fees and the ability to bring paying 
guests as well as a share in the course including the land. Much social and 
business activity is conducted at golf courses where the ability to be in a less 
crowded environment is highly valued. 

The golf course membership index in Tokyo and the stock market gener- 
ally move in tandem, since both of these indices reflect economic activity which 
is centered in Tokyo. However, there have been two recent cases where golf 
course membership prices rose far above the NSA, before the gap was closed. 
As shown in Figure 3, the first occurred in 1986-88, thus occurring both before 
and after the October 1987 stock market crash. In 1986, golf course prices 
moved above stock prices, which in turn moved above land prices. Golf course 


9 i 3 ae Se eit, : . he St 
When there are no bid or ask prices, statistical substitution is used to adjust the index to maintain 
continuity. 
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Figure 3 
Quarterly Tokyo Golf Course Membership Prices and the Nikkei Stock 
Average, 1982-1992. 


NSA led Golf 
600+ Mar 90 fall ~ 
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Source: Stone and Ziemba (1992) 


membership prices started falling in the quarter prior to the October 1987 
world-wide stock market crash and by early 1989 stocks and golf course prices 
were in balance again. Then in January 1990, stocks began to fall in reaction to 
higher interest rates, but golf memberships and small capitalized stocks did not 
fall until March 1990 and all land in September 1990. The prices in late 1992 of 
both the NSA and Tokyo golf courses seem to be consistent with Japan’s 
economic growth in the 1980s with the excesses of the 1985 to 1989 period 
eliminated. 

Based on these episodes, it is difficult to conclude that prices for golf 
memberships, or speculative land more generally, lead stock prices. Stone and 
Ziemba (1990, 1992) offer formal confirmation of this result. However, they do 
find evidence that golf courses lead land price returns more generally, and that 
stock price returns lead land price returns, even when land is defined to 
include golf course membership returns. The evidence is that a move in stock 
prices impacts golf course prices in about three months and land in nine to 
twelve months. 


Has Japan Experienced a Speculative Bubble 
in Land and Stock Prices? 


Japanese stock prices fell over 60 percent from their peak at the end of 
1989 to their valley in August 1992. Meanwhile, speculative land prices (as 
measured by the golf membership index) fell over 50 percent by August 1992 
and by over 60 percent in early 1993. While these declines are enormous, they 
don’t quite measure up to some of the other spectacular collapses in financial 
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market history. For example, the U.S. stock market in the Great Depression 
declined by 87 percent; the Hong Kong stock market fell by 92 percent in the 
early 1970s; the Mexican stock market had fallen by 78 percent in 1981; and 
Taiwanese stocks had fallen by 80 percent in 1990. To take an even older 
example, the “tulipmania” experience in Holland led to a 93 percent decline in 
value in 1637. 

Despite the dramatic declines in stock prices and speculative land, the fall 
in essential land prices from 1990 to 1992 has been only about 15~20 percent. 
This fall is nonetheless striking, because Japanese land index values had not 
previously fallen in the postwar period, except in the aftermath of the 1973-74 
oil crisis. The latter decline was very small, despite a major restructuring of the 
Japanese economy and a 37.4 percent drop in the NSA. Prices began rising 
again after six months. Japanese land did not decline in the second oil crisis, 
which did not have the shock impact of the first oil crisis. Moreover, remember- 
ing that land and stock prices have tended to move together, with stock prices a 
bit in front, it would seem that land prices remain extremely high. 

In the end, can the level of Japanese land and stock markets, or the sharp 
movements of those markets in the 1980s, be explained by fundamental factors, 
like interest rate movements? Were they explainable for a time, and then no 
longer explainable? Let us first state the case for explaining these price 
movements with fundamental factors, and then the case for believing that some 
sort of speculative bubble occurred. 


Evidence on Fundamental Value 

Japanese land prices were often 100 times greater than those in the United 
States. Differences of that magnitude are consistent with economic rationality, 
assuming that the intensity of land use in Japan is 20-30 times higher and the 
required rate of return on rents is about one-fourth to one-third as high. Both 
of those conditions were consistent with the data up to 1989. Boone’s (1989) 
analysis is also consistent with the 1990-92 decline with an upward adjustment 
of the required rate of return at a time of increased interest rates. Liu and Mei 
(1991) also find support that stocks and real estate prices are based on 
fundamental values. Ziemba and Schwartz (1991) and Ziemba (1991b, 1993) 
find that Japanese stock prices react to similar fundamental and seasonal 
factors as U.S. stocks, with earnings growth the most important variable for 
stock price changes. Gampbell and Hamao (1992) show that the dividend price 
ratio and interest rate variables are related to excess returns in Japan. 

Not only are the levels of asset prices consistent with economic rationality, 
but the boom and bust cycle of the 1980s and 1990s can also be explained by 
rational factors—primarily movements in the short and long-term interest 
rates. Figure 4 shows the short-term interest rates in Japan from June 1984 to 
August 1992. The dramatic decline in rates in 1985-86 following the Plaza 
accord in September 1985 in the wake of the substantial appreciation of the yen 
and the expansion of the monetary supply coincided with the more than 
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Figure 4 
Short-Term Interest Rates in Japan, June 18, 1984—August 12, 1992 
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doubling of land prices and the sharp rise in stock prices from 1985 to 1987. 
Interest rates then stayed low from 1987 to early 1989. Then the Bank of 
Japan’s tight money policy in 1989 and 1990 led to successive increases in the 
discount rate from 2.5 percent to 6.0 percent, which presaged the decline in 
stock and speculative land prices. Even cheaper money was available for large 
corporations in the late 1980s through equity warrant bonds, which are traded 
mainly in London.'® The 1980s were a period of cheap and easily available 
money. In contrast, the 1990s have been a period of expensive and hard to 


With currency hedging, the bonds with these detachable warrants had net cost of borrowing of 
—2 percent to +2 percent. The basic warrant bond was an ingenious product of the 1980s. A 
three-year bond with a detachable warrant provides an interest rate of, say, 4 percent versus the 
market rate of 8 percent in U.S. funds because of the value of the warrant. Currency hedging the 
bonds proceeds in dollars back into yen with a yen interest rate of (say) 3 percent below that of the 
dollar provides a net cost of 1 percent for the money in yen because the future yen value must be at 
about a 3 percent premium per year to the U.S. dollar to avoid riskless arbitrage. In addition, when 
the warrants are exercised, the firm receives a substantial cash infusion in exchange for a slight 
dilution of the equity. Higher yen interest rates versus the dollar and a weak stock market (no 
longer receptive of additional supply such as these warrants) essentially halted this market in 1990. 
Recent issues are largely in marks or Swiss francs which have high interest rates. Kuwahara and 
Marsh (1992) discuss this market and the pricing of these warrants. There are some $100 billion 
plus of these warrants that will expire worthless if the stock market does not rise considerably in 
1993-94. 
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arrange financing. The low short-term interest rates shown in Figure 4 do not 
reflect the true cost of most borrowings in 1992. For example, ten-year 
government bonds whose rates were slightly higher than the short-term rates 
in 1987-89 rose to a peak in August 1990 slightly below the short-term rates 
and then declined in 1992, but at a slower rate than the short-term rates. In 
late 1992, these rates were about 150 basis points higher than the short-term 
rates. 

An alternative way to test whether fundamental values can explain stock 
prices is to look at price /earnings ratios. While the NSA was trading in the late 
1980s at P/E ratios significantly higher than those in the United States, much 
of the difference can be explained by alternative accounting, timing and related 
institutional factors, as shown by Aron (1981, 1989) and French and Poterba 
(1991). Paul Aron (1989) has computed adjusted Japanese price-earnings ratios 
on a continuing basis since 1981. He found that the Japanese P/E ratios have 
been roughly comparable to those in the United States in the 1980s, when one 
adjusts for the differing required rates of return in the two countries. French 
and Poterba (1991) arrive at a similar conclusion, using a somewhat different 
approach.''! However, since calculations like these were completed, interest 
rates have declined substantially in the U.S. and the U.S. stock market has 
risen. Short-term interest rates in Japan in August 1992 were at levels as low as 
in 1986. It is no longer clear that price-earnings ratios continue to reflect only 
the difference in required rates of return between the U.S. and Japan. 

Calculations in Ziemba and Schwartz (1991) argue that once the required 
rates of return in the two countries are adjusted for current interest rates, the 
market in December 1989 was greatly overpriced. This model also accurately 
reflects the rises and declines in Japanese stocks during 1990-91. However, it 
may be that investors use lagged- or levered-interest rates in their calculations, 
rather than current rates, or that investors are not fully rational and partially 
base their decisions on investor sentiment, as in the noisy trader hypothesis of 
Shleifer and Summers (1990). That point of view is consistent with the finding 
that overpriced stocks were followed immediately by two sharp declines during 
1990. Further evidence in support of this view was the continual decline in the 
equity indices in 1991-92 while interest rates were in a substantial decline. 


Speculative Bubble Evidence 

How does one test for a speculative bubble, or even think about such an 
event? One common way of making the distinction is to say that if price levels 
and movements can be explained by fundamental factors, then no bubble 
exists. Conversely, following Stiglitz (1990), we assume that a bubble exists if 
the reason that the price is high today is only because investors believe that the 
selling price will be higher tomorrow when fundamental factors do not seem to 


"French and Poterba (1991) focus on the 1986 doubling of price-earnings ratios. ‘They argue that 
accounting differences can explain about half the long-run differences in United States and 
Japanese price earnings ratios but not the 1986 doubling. 
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justify such a price. While in principle straightforward, it is difficult in practice 
to separate the bubble from the changes in fundamentals in a particular asset 
market. An examination of econometric work concerned with bubbles versus 
fundamentals by Flood and Hodrick (1990) concluded that no study has yet 
managed to solve the problem of separating the bubble movements from the 
possibility that the underlying fundamental problem is misspecified. 

With these warnings in mind, Ueda (1990) argued that stock prices under- 
esumated the value of the corporate assets during 1970-83, but overestimated 
the value of these assets since 1983. The extreme increase in stock prices in the 
1980s occurred during a period of declining required risk premiums in relation 
to rates of return. Hence, the sharp increase in prices of stocks prior to their 
steep decline in 1990 was at least partially based on expectation of future 
increases in asset prices of land and stock shares held. French and Poterba 
(1991) present similar analyses and conclusions. 

Japanese land in 1991 was worth about $20 trillion dollars, over 20 percent 
of the world’s wealth. From the mid-1950s to the mid-1980s, Japanese land 
values increased 11 times as much as rents. Rose (1990) also found that land 
prices increased more rapidly than income, whereas housing rents increased 
less rapidly then income in this period. Noguchi (1991) found that land prices 
are far in excess of the discounted sum of rents and pricing equations. 

An alternative approach which lends some insight is to investigate the 
distribution of prices. When Rachev and Ziemba (1992) did this for 
the distribution of the golf course membership prices, for example, the result- 
ing distribution had very fat tails compared to typical distributions for U.S. 
stock prices (Fama and Roll, 1971; Akgiray and Booth, 1988) and Japanese 
stock prices and was not normally distributed.'!? The continual decline of golf 
course prices during 199] and early 1992 had a marginal effect on the 
parameter estimates. The fat tails means that the probability in any short time 
interval of a large increase or decrease is very high. 

We have no evidence that recent Japanese stock prices have distributions 
that are consistent with the bubble hypothesis. However, this has not been 
adequately studied; one study we are aware of is Tse (1991), who used data 
prior to the 1990-92 stock market decline and excluding the October 1987 
crash, and found no evidence of a bubble. Rachev and Ziemba (1992) using 
data from 1983~92 found Japanese stock prices do fit a stable distribution well 
and have tails with mass intermediate between U.S. stocks and Japanese golf 
courses. But perhaps this uncertainty about a bubble in the stock market is to 
be expected, since current studies are not even able to agree whether an event 
like the October 1987 crash of the U.S. stock market was a speculative bubble. 
To cite a few examples, Froot and Obstfeld (1991), Hardouvelis (1988), and 
Miller (1990) argue that there was a speculative bubble, while Dezhbakhsh and 
Demirguc-Kunt (1990) argue that prices followed a random walk based on 
current information. 


'2Mittnik and Rachev (1993) survey the modeling of asset returns with stable distributions. 
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Overall, we would say that there is some evidence for a bubble in specula- 
tive land prices, as measured by the golf course membership index. Remember, 
purchases of such speculative land were highly levered. On the other side, the 
distribution of price changes seems to offer little evidence (at least so far) for a 
bubble in the market for essential land or the stock market. 


Final Remarks 


The decline in Japanese financial markets has put intense financial pres- 
sure on the banks and other financial institutions. In particular, the banks must 
satisfy the Bank of International Settlements regulations in 1993 (Shibota, 
1991). This caused the government to provide a major fiscal policy stimulus in 
August 1992. This, along with very low interest rates, would seem to presage a 
change in investor sentiment in Japan. The economic evidence seems to point 
to lower land-prices in the near-term—about 20 percent according to the 
regression models—but perhaps some recovery in the stock market (whose 
market capitalization has fallen well below its worldwide share of economic 
activity), thus gradually bringing the value of these two assets back toward their 
historical congruence. 

As to the question of whether a speculative bubble actually occurred or not, 
the answer seems to involve rather subtle matters of definition. Low interest 
rates in the mid-1980s, combined with the interrelationship between stock and 
land markets, clearly helped to trigger a boom. However, it appears likely that 
the boom went somewhat beyond what could be justified based on fundamental 
factors. On the downside, then, Japanese government policy formulated in 
1988 and implemented beginning in 1989 was to deflate the so-called “bubble 
economy” through much higher interest rates (Flack, 1990; Shale, 1991). These 
rate increases triggered the initial decline in asset prices. But as more and more 
selling occurred, there was a strong multiplier effect with individuals and 
corporations that were highly levered in stocks and land being forced into more 
financial trouble because of the decline in the market thus leading to their 
forced selling. This effect was strong enough that even though short-term 
interest rates had returned by 1992 to a level lower than those at any time in 
the mid to late 1980s, and long-term rates were as low as they had previously 
been, asset prices remained low. A factor in this is the availability of credit. It is 
simply much more difficult in 1992-93 to arrange financing than it was in the 
late 1980s. 

On one hand, it would seem something of an artificial distinction to say 
that movements in interest rates are a fundamental factor, but the cycle of 
leverage these movements unleashed are a speculative bubble. On the other 
hand, if the simple definition of a bubble is when investors are buying and 
selling only on the basis that prices will rise further, than surely using the high 
price of previously purchased assets as collateral for loans to buy still more 
assets should qualify as a bubble. 
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Chapter 10 


THE CHICKEN OR THE EGG: 
LAND AND STOCK PRICES IN JAPAN" 


William T. Ziemba 
Faculty of Commerce, University of British Columbia, Vancouver, B.C. V6T 1Y8 


Tsukamoto Sozan Building in Ginza 2-Chome in central Tokyo is built on 
the most expensive land in the country with one square meter priced at 
¥37.7 million or about $279,000 U.S. at the (December 1990) exchange rate 
of about ¥135 per U.S. dollar. 


Abstract 


It is well known that land prices in Japan are the world's highest and that the 
stock market has increased dramatically. This paper explores the relationship 
between the price levels of these two markets. The results indicate an 
extraordinarily close relationship, particularly for commercial land. While the 
prices of land and stocks are highly related, the evidence is that stock prices lead 
land prices and not the reverse. 


Land Prices in Japan are Astronomical 


Some 120 million people live in Japan in an area of about 377,800 km? which 
is about the size of Montana. Most of the land is mountainous or is used for 
agriculture. Indeed, less than 5% of the land is used to house all the people. A 
breakdown of land use appears in Table 1. Hence, with high incomes, crowded 
conditions, and an intense desire to invest at home, land prices in the most 
desirable locations have escalated beyond belief. Almost 30 million people, or a 
quarter of the population, lives in the greater Tokyo area. Much of the housing 
in Japan's major cities is owned by the large corporations and their employees 
receive subsidized rent. However, fully 60% of families in Japan, and 55% in 


* Portions of this research were conducted under William T. Ziemba’s direction at the Yamaichi 
Research Institute in Tokyo in 1988/89. Without implicating them I would like to thank my 
colleagues there, particularly Hirokazu Yuihama and Hitoshi Ishiyama for their help. Thanks 
are also duc to Mr. Motohiko Higashikawa of the Japan Real Estate Institute for supplying data to 
me. This research was also partially supported by the Centre for International Business Studics, 
Univeristy of British Columbia and the Social Sciences and Humanities Research Council of 
Canada. 
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Tokyo, own their own homes and many individuals invest in raw land or in 
apartments that they rent to others. 


Table 1:_Land Use in Japan (1986) 
Agriculture 14.5 


Woodlands 66.9 


Moors 0.8 
Rivers 3.5 
Roads 2.9 
Dwellings 4.0 
Other 7.4 
Total 100.0 


Total Area 377.8 (1000 km?) 
Source: National Land Agency, Japan 


Table 2 and figure 1 give the Japan Real Estate Institute's land indices for the 
six largest cities, and for all of Japan for commercial, housing, industrial and total 
land for each six month period from March 1949 to September 1990. The indices 
are for March (1) and September (2). Figure 1 also gives the yearly rate of changes 
as of March of each year. The six largest cities are Tokyo, Osaka, Nagoya, 
Yokohama, Kobe and Kyoto. The country wide indices are based on 140 cities. 
The data are appraisal based which tends to smooth the price levels and lag the 
market. Simple averages of samples of ten lots in each city form the indices 
which were normalized at 100 as of March 31, 1980. The sampling procedure 
separates land into high, medium and low grades reflecting location, social 
circumstances, yield, etc. The sampling procedure selects lots randomly and 
equally from each of these three classes. 

Table 2 also indicates that the price increase has been largest in the six largest 
cities. Despite large recent rises, the relative gain in the period 1955 to 1970 was 
much larger than from 1970 to 1990. For land in the whole country, the 1955 to 
1970 period produced gains of about 15 times 1955 values. These prices then 
increased only about four fold in the ensuing twenty years. In the six largest 
cities, the increase was also much larger in the 1955 to 1970 period versus the past 
two decades. 
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Table 2: Land Indices in Japan 1955-1990 


Nationwide Six Largest Cities Consumer 
Al Com- Housing Indus- All Com- Housing Indus- Price 
mercial trial mercial trial Index 
1955:1 3:1 3.7 2.5 3:2 2.4 4.9 1.7 22 
1955:2 3.3 4.0 2.7 3.4 2.6 5.1 1.9 2.3 
1956:1 3.5 4.3 2.9 3.7 2.8 5.6 1.9 2.6 
1956:2 3.9 4.8 3.2 4.1 3.2 6.1 2:2 3.0 
1957:1 4.5 5.5 3.6 4.8 3.6 6.7 2.6 3.5 
1957:2 5.0 6.1 4.0 5.4 4.2 7.4 2.9 4.1 
1958:1 5.5 6.6 45 5.9 4.6 7.8 3.3 4.7 
1958:2 6.1 7.2 5.0 6.5 5.0 7.9 3.6 5.3 
1959:1 6.8 8.1 5.5 7.3 5.5 8.4 4.0 5.9 
1959:2 7.7 9.1 6.2 8.3 6.3 9.6 4.6 6.8 
1960:1 8.7 10.5 6.8 9.5 7.2 11.2 5.2 7.9 
1960:2 10.2 12.4 7.8 11.4 9.3 14.1 6.0 10.6 
1961:1 12:3 14.4 9.3 14.6 12.1 18.0 7.5 14.8 
1961:2 14.5 16.7 10.8 17.4 15.7 22.9 9.5 20.0 
1962:1 15.7 17.9 11.8 19.1 17.3 24.3 10.5 22.2 
1962:2 17.1 19.2 12.8 21.0 19.0 25.4 11.9 24.2 
1963:1 18.4 21.0 13.6 22.7 20.5 27.1 13.1 26.1 
1963.2 19.6 22.1 14.5 24.5 22.3 29.0 14.5 28.4 
1964:1 21.0 23.5 15.5 26.3 24.1 31.1 15.9 30.4 
1964:2 22.5 25.4 16.6 28.0 25.6 32.9 17.1 32.0 
1965:1 23.8 26.6 17.8 29.6 26.4 33.8 17.8 33.1 
1965:2 24.4 27.5 18.2 30.1 26.6 34.1 18.0 33.2 
1966:1 25.0 28.4 18.9 30.5 26.9 34.5 18.4 33.2 
1966:2 25.8 29.5 19.6 30.8 27.2 34.8 18.9 33.2 
1967:1 27.1 31.4 20.8 31.9 28.1 36.3 19.6 34.0 
1967:2 28.8 33.3 22.4 33.2 29.2 37.6 20.7 35.0 
1968:1 30.8 35.7 24.2 35.1 30.5 38.8 22.0 36.3 
1968:2 33.2 38.6 26.3 37.4 32.4 41.1 23.9 38.1 
1969:1 36.1 41.8 29.0 40.2 35.1 44.4 26.1 40.9 
1969:2 39.6 46.1 32.2 43.2 38.1 48.0 28.7 44.1 
1970:1 43.2 49.9 35.5 47.0 41.3 51.4 31.3 47.8 
1970:2 46.8 53.8 38.7 50.7 44.8 54.1 34.4 52.0 
1971:1 50.0 57.0 41.8 54.1 48.0 56.5 37.2 55.8 
1971:2 53.2 59.8 44.9 57.8 51.0 58.8 40.0 59.4 
1972:1 56.5 63.2 47.8 61.8 54.1 61.7 42.8 62.7 
1972:2 61.2 67.8 52.1 66.7 59.6 66.7 48.1 68.6 
1973:1 70.8 76.6 61.6 77.0 71.2 77.0 59.2 80.8 
1973:2 81.1 85.9 71.8 88.5 81.1 85.8 68.5 91:7 
1974:1 87.0 91.4 77.8 94.8 84.1 88.7 71.0 95.3 
1974:2 88.2 92.4 79.1 95.7 84.5 89.1 71.5 95.4 
1975:1 83.3 87.8 74.6 89.7 77.3 82.1 65.6 86.7 100.0 
1975:2 83.5 88.0 75.0 89.7 77.5 82.1 66.2 86.8 
1976:1 83.9 88.3 75.7 89.9 78.0 82.4 66.9 86.8 108.3 
1976:2 84.7 88.7 77.0 90.2 78.8 82.8 68.3 87.1 
1977:1 85.7 89.3 78.7 90.7 79.9 83.9 69.9 87.4 118.1 
1977:2 86.9 90.0 80.5 91.1 80.9 84.5 71.6 87.8 
1978:1 88.1 91.0 82.5 91.8 82.2 85.7 73.6 88.4 112.6 
1978:2 89.8 92.2 85.1 92.9 84.5 87.4 77.2 89.7 
1979:1 92.1 93.9 88.7 94.4 88.3 90.3 82.9 92.0 127.0 


1979:2 95.5 96.5 93.7 96.8 94.2 95.3 91.6 96.0 
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Table 2. Land Indices in Japan 1955-1990 (continued) 


Nationwide Six Largest Cities 
Consumer 
All Com- Housing Indus- All Com- Housing Indus- Price 
mercial trial mercial trial Index 


1980:1 100.0 100.0 100.0 100.0 100.0 1000 100.0 100.0 137.2 
1980:2 104.6 103.7 1065 103.5 1046 1043 106.2 103.4 
1981:1 108.7 106.9 112.2 106.7 108.5 108.1 110.6 106.7 143.7 
1981:2 112.8 110.3 117.6 109.9 1124 1125 114.8 109.9 
1982:1 1164 1134 1224 1126 115.7 116.7 117.9 112.6 147.6 
1982:2 119.4 1160 1264 115.0 118.5 1206 120.2 114.8 
1983:1 121.9 1183 1295 117.0 121.3 124.6 1225 116.7 150.5 
1983:2 124.0 120.2 131.9 118.9 123.9 1285 1246 118.6 
1984:1 125.8 122.0 134.1 1203 127.6 135.7 1268 1206 153.8 
1984:2 127.6 123.8 1360 121.8 132.2 1444 1296 123.1 
1985:1 129.3 125.7 137.7 123.2 137.1 153.6 133.8 1250 157.0 
1985:2 131.0 127.8 139.2 124.5 143.3 167.1 138.1 127.1 
1986:1 133.0 130.9 140.7 125.8 156.6 197.9 146.7 131.2 158.1 
1986:2 135.6 134.9 1427 1273 173.8 2293 165.3 137.0 
198721 140.2 141.1 147.0 130.1 197.2 264.7 1863 1536 158.3 
1987:2 149.9 153.3 156.1 137.0 234.8 336.7 216.1 175.4 
1988:1 154.2 159.9 159.3 140.1 252.2 375.2 2294 183.2 159.3 
1988:2 159.2 166.9 162.7 144.7 279.9 420.0 240.0 213.1 
1989:1 165.9 175.9 168.1 150.1 318.8 467.5 264.5 243.7 163.4 
1989:2 174.8 1868 176.0 157.7 356.6 528.8 300.7 279.4 
1990:1 189.3 203.9 189.6 170.2 408.2 599.2 253.0 315.7 
1990:2 203.1 220.55 202.9 1814 429.1 625.9 372.4 331.7 


# times increase in ¥ 

1955:1 to 1990:2 65.5 59.6 81.2 56.7 178.8 127.7 219.1 150.8 
1955:1 to 1970:2 15.1 14.5 15.5 15.8 18.7 11.0 20.2 23.6 
1970:2 to 1990:2 4.3 4.1 5.2 3.6 96 11.6 10.8 6.4 


Land values in the six largest cities have outpaced the CPI by twenty times 
since 1955. In the Ginza district of Tokyo each square meter of land is worth over 
$200,000 U.S. Choice downtown land in Tokyo goes for the equivalent of nearly a 
billion dollars an acre. At neighboring land prices, the value of land under the 
Emperor's palace and garden in Tokyo equals that of all California or of Canada. 
The golf courses of Japan alone are worth more then the entire Australian stock 
market, some A$250. The total land value in Japan in 1990 was about 4.1 times 
that of the whole United States. Japanese land was worth some ¥2180 trillion as 
of the end of 1989. This compares with a value of ¥1050 trillion at the end of 1985. 
Using an exchange rate of ¥143.76 per dollar at the end of 1989, gives a land value 
of $15.16 trillion. As of September 1990, all land had an index of 203.1, up 16.2% 
from September 1989. With an end of 1990 exchange rate of 135.40, total Japanese 
land values were in the $18.7 trillion range in late 1990. The average acre of land 
in Japan is worth fully 100 times the average acre in the U.S. So even though the 
U.S. has about 25 times more land than Japan, its current total value is less than a 
fourth as much. Essentially half the world's land value at 1987-90 prices is 
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accounted for by Japanese land! It also accounts for about 20% of the total asset 
value in the world. Simple houses in Tokyo rent for more than $10,000 per 
month and cost in the millions. Office space for sale in Tokyo's financial district 
costs nearly $75,000 per square foot. Some luxury apartments in Tokyo rent for 
well over $20,000 per month. Figure 2 compares land prices throughout Japan, in 
1988 with Tokyo normalized at 100. Osaka was then 62, Nagoya, 27, and most 
other metropolitan areas in the 12 to 18 range. Figure 3 compares land value by 
region from 1983 to 1987. 


Figure 1: Land price indices for industrial, residential, commercial and all land 
and annual rates of price change for all land, 1955 to 1990 


a. In the six largest cities 
(semi log scale) 
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b. In the entire country (140 cities) 
(semi log scale) 
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Source: Japan Real Estate Institute 


In 1988 Tokyo's land value alone was about $7.7 trillion, or about half the land 
value of the whole country. To understand how much this is we can do a 
idealized experiment. Let's borrow on it up to 80% of its value. Banks in Tokyo 
commonly provided such loans based on land security until the high interest 
rates of 1990. From 1987 to 1989, the interest rates on loans secured by land were 
5.7% and 6.6% for variable and fixed rate loans, respectively. We would then 
have almost enough money to purchase all the land in the U.S. for $3.7 trillion 
and all the stock on the New York, American and NASDAQ over-the-counter stock 
exchanges for about $2.6 trillion in an all-cash transaction. Obviously, one could 
not sell all of Tokyo's land for $7.7 trillion quickly, nor would a group of banks 
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undertake such a large loan, but this was the value of land prices in fiscal 1987. In 
Tokyo about 2% of land changes hand each year. The price is kept up and bid 
higher because of the excess of demand over supply. 


Figure 2: Housing Land Price Index 
Tokyo = 100 (¥507,300/ m2), Osaka=62; Nagoya=27, Other Areas=12-18 
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Figure 3: Land Value by Region in Japan 
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Source: Economic Planning Agency reported in Canaway (1990) 


Figure 4 shows that as of 1985 a staggering 56% of the national wealth of Japan 
was land. The current percentage may be even higher since there was a huge 
price increase in 1986 and steady rises since then. 

Land turnover is very small as the Japanese believe in holding land whenever 
possible this is reinforced by the tax system which encourages the purchase of 
more land and discourages land sales. The population in per unit of habitable 
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area is thirty times higher in Japan than the U.S. The GNP and energy 
consumption per habitable area are also much much higher in Japan than in the 
U.S. (though the energy per unit GNP is much lower in Japan), see Table 3. This 
puts upward pressure on land prices. 


Figure 4: Composition of National Wealth in Selected Countries 
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Source: Economic Planning Agency, Japan 


Table 3: Comparison of Fundamentals, Japan and the U.S., 1989 


Japan as 
Japan U.S. % of U.S. 
Population, millions 120 239 50.21 
Total area (1000 sq km) 377 9373 4.02 
Habitable area (1000 sq km) 80 4786 1.67 
Population per habitable 1500 50 3000.00 
area (pop/sq km) 
GNP per habitable area 16.90 0.80 2112.50 
(million $/sq km) 
Energy consumption (tons 4650 390 1192.38 


oil equivalent/ sq km) 


Source: Daiwa Securities America, Inc 


If there is to be a major stock market crash in Japan it may well start with or be 
linked to land values. We study the link between the stock market and land 
prices in this paper. Central Tokyo land price increases were relatively firm in 
1988-1990 but large increases continued in the suburbs and in other cities. Will 
property prices crash at some later date? It is hard to say, but with the bulk of the 
property controlled by the major companies and the government, with the tax 
structure and the incentives favoring land holding, and with buying by both 
individuals and institutions, the prices may well stay at these lofty levels. 
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Boone (1989) developed several models in an attempt to rationalize the high 
land values in Japan from an economic point of view. He found that if Japan's 
GNP growth exceeds that in the U.S. by about 2% per year forever, then land 
prices 100 times higher in Japan than in the U.S. are consistent with the economic 
model. He also developed a simple model for the relative land price in Japan 
versus that in the U.S.: 


. hee Land value Rents GNP 
Relative land price in Japan = Rents x GNP X Tand 
1 
= Fi x 1 x b, 


where a is essentially the ratio of rents in Japan for comparable properties which 
are known to be 0.25 to 0.50 in the U.S., b is the GNP to land estimate which 
ranges from 20-30 (Table 2 estimates this at 21 in 1989), and rents/GNP= 1 is 
consistent with the Cobb-Douglas production model that Boone assumes. Using 
these a's and b's gives relative land prices close to the actual ratio of 100. 


JAPANESE HOUSEHOLDS 

The vast majority of the wealth of Japanese households is contained in their 
land and buildings. This constitutes nearly two thirds of their assets. Various 
savings deposits amount to about 14%. Rates of return on these savings accounts 
are regulated, change infrequently and have been low - about 4% for the best 
investments- but this income is usually not taxed. Deregulation in 1989 and 1990 
has made available acounts paying 7% or more, where the income is taxable. 
Securities, insurance and pension assets are only about 6.9% of the Japanese 
wealth and the proportion of this that is corporate stock is astonishingly low, 
some 0.3%. Government bonds and bank debentures each accounts for 0.9%, so 
the largest bulk of the savings goes into cash savings instruments. This is 
partially explained by the practice of financing Japanese corporate needs with bank 
loans rather than equity. Household assets are growing at about $1.83 billion U.S. 
per day (see Ziemba and Schwartz, 1991b). The excess is going mostly into savings 
deposits much of which is then made available largely to corporations for loans to 
purchase assets in the stock market, land, bonds, and overseas investments. 

Households in the U.S., in contrast, have their assets concentrated in time 
deposits (23.2%), insurance and pension funds (13.1%) and securities (19.7%) 
along with residential (23.4%), with land comprising only 7.1% of total assets. 
Table 4 details the differences in the household shares in Japan and the U.S. using 
averages for 1974-83. Consumer durables are one third as widespread in Japan 
compared to U.S. households, which is not surprising considering the consumer 
oriented nature of the U.S. economy. Space is a major factor in this. In Japan 
there simply is little space to store things, hence much less is purchased and 
stored. Moreover when new items are purchased old ones are trashed or moved 
out elsewhere. 
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Table 4: Portfolio Shares of Japanese and U.S. Households, Averages for 1974-83 


Japan U.S. 
Cash 0.8 na 
Demand Deposits 1.2 3.4 
Time Deposits 7.9 23.2 
Postal Saving 4.9 - 
Trust 1.8 - 
Insurance and Pensions 3.8 13.1 
Securities 3.1 19.7 
Residential Structures 11.6 23.4 
Land 53.4 71 
Consumer Durables 4.0 11.9 
Noncorporate Structures 
equipment & invent 7.3 6.1 


Source: compiled in Noland (1988) 


Figure 5 shows the volume of real estate loans. Of particular interest is its 
variability and predictive power for leveling off periods. Rents shown in figure 6 
are rising slower than land prices. Hence one expects increases in valuation to 
provide profits. That is, the short-run losses which are tax deductible off income 
will be made up by long-term capital gains which are taxed at lower rates. For 
additional discussion on these land prices issues, see Boone (1989), Canaway 
(1990), Cults (1990), Fingleton (1990) and Flack (1990). 

High interest rates which led to a sharp fall in stock prices in 1990 have not led 
to any decline in land prices as shown in Table 2. However, there was a sharp 
decline in speculative land such as golf course membership and condos, see 
Figure 13 below and Stone and Ziemba (1990). As interest rates rise, land demand 
falls but in Tokyo, with virtually no new supply, demand still greatly exceeds 
supply. At the same time supply declines with higher interest rates as 
development costs are curtailed. All the incentives favor holding land and not 
even developing it. As Canaway (1990) has pointed out, land held less than five 
years is taxed at fully 52% of its sale value. Meanwhile, yearly taxes paid to hold 
land are about 0.05 to 0.10% of current value. Even upon death it pays to borrow 
money which is deductible at full value while land is valued at about half its 
market value. Hence inheritance taxes are minimized. Canaway argues that in a 
major crash the stock market will go first, then the economy and finally the land 
markets. The results in the paper are consistent with this view. 
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Figure 5: Real-Estate Loans in Japan 
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Figure 6: Residential Land Prices vs Rents 
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Source: Japan Real Estate Research Institute, Management and Coordination Agency 


LAND AND THE NSA CORRELATIONS 
The growth in the Nikkei stock average and its declines are discussed by Schwartz 
and Ziemba in the preceeding article in this volume. 

This paper uses the NSA to study the relationship between the levels of stock 
and land prices in Japan. The NSA is the most popular index and presumably 
reflects investor sentiment in the economy, so it might relate to land prices at 
least as well as the Topix. Since the Topix and NSA have a very close correlation, 
one would not expect the results to be much different anyway. For more details 
and results on the Japanese stock market, see Elton and Gruber (1989), and Ziemba 
and Schwartz (1991a), and the papers in this volume. 

The biannual data from 1955 to 1988, yields the following correlation matrix 
for the NSA, and land price indices for all land, all commerical land, housing 
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land, industrial use land throughout Japan and these latter four indices for the six 
largest cities. 


Table 5: Contemporaneous correlations between the NSA and various land 
indices. 


Nikkei Land Indices Throughout Japan Indices in argest Cities 
Dow ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 

1.000 
0.858 1.000 


0.857 0.998 1.000 

0.874 0.995 0.989 1.000 

0.820 0.996 0.997 0.984 1.000 

0.963 0.958 0.960 0.961 0.938 1.000 

0.988 0.879 0.885 0.885 0.850 0.978 1.000 

0.956 0.965 0.962 0.974 0.942 0.996 0.964 1.000 

0.889 0.991 0.995 0.981 0.987 0.978 0.919 0.975 1.000 


OANA ALEWN e 


All the correlations of land with the NSA are high. The lowest is 0.82 for 
industrial land throughout Japan. The six largest cities have the highest land 
prices and their land values correlate very closely with the NSA. Indeed all land 
is at 0.963, housing at 0.956 and commerical land at 0.988. Industrial land, much 
of which is lower priced large acreage away from the major cities, correlates less 
with the NSA. 

The concern is with causality between land and the NSA, so we record the lag 
behind and future correlations. The six month prior lag behind index values for 
the land versus the current NSA are shown in Table 6. 


Table 6: Correlations between current NSA values and lagged behind land values 


. ice: ro t Ja Land Indices in 6 Largest Cities 
Current Six Months Before Six Months Before 
NSA ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 
1.000 
0.853 1.000 


0.847 0.998 1.000 

0.873 0.995 0.988 1.000 

0.816 0.996 0.998 0.983 1.000 

0.945 0.970 0.969 0.973 0.953 1.000 

0.978 0.905 0.906 0.911 0.881 0.980 1.000 

0.940 0.972 0.966 0.982 0.951 0.996 0.968 1.000 

0.868 0.993 0.996 0.982 0.992 0.980 0.929 0.974 1.000 


WOONauRwWNH 


These correlations are only slightly lower then the contemparaneous 
correlations. Again commercial land in the six largest cities has by far the highest 
correlation. 
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Lagging forward yields the current NSA and the six month future land index 
values in Table 7. 


Table 7: Correlations between current NSA values and future land values 


a ic ices j est Cities 
Current Six Months Later Six Months Later 
NSA ALL COML HOUSE INDL ALL COML HOUSE INDL 
1 2 3 4 5 6 7 8 9 
1 1.000 
2 0.863 1.000 
3 0.868 0.998 1.000 
4 0.875 0.996 0.989 1.000 
5 0.824 0.996 0.996 0.984 1.000 
6 0.976 0.946 0.951 0.948 0.923 1.000 
7 0.993 0.860 0.871 0.864 0.827 0.978 1.000 
8 0.966 0.960 0.960 0.968 0.935 0.995 0.960 1.000 
9 0.915 0.984 0.990 0.975 0.977 0.979 0.923 0.978 1.000 


These correlations are higher than the lag behinds and the contemparaneous 
and suggest that the NSA's current value predicts land values in the future better 
than the reverse. The correlation with commercial land six months later in the 
six largest cities is an astounding 0.993, or an R2 of 0.986 for a regression fit. Hence 
the level of the NSA explains nearly 99% of the biannual variation of the level of 
commercial land prices in Japan's six largest cities in the past 34 years! 

Table 8 has the correlations with the NSA and commerical and in the six 
largest cities with lag behind of six (-1) or twelve months (-2), and lag forward of 
six (+1), twelve (+2) or eighteen (+3) months, plus the contemparaneous values, 
again with current NSA. 


Table 8: Correlations between current NSA values and lagged behind commercial 
land values 


Current Commercial Land Indices in an's Six Largest Cities 
NSA -2 -1 Current +1 +2 +3 
1 2 3 4 5 6 vA 

1 1.000 

2 0.949 1.000 

3 0.978 0.996 1.000 

4 0.988 0.985 0.996 1.000 

5 0.993 0.964 0.982 0.994 1.000 

6 0.989 0.943 0.966 0.983 0.995 1.000 

7 0.985 0.926 0.950 0.971 0.987 0.996 1.000 


The best contemparaneous fit is with commercial land in the six largest cities 
as shown in Figure 7. The regression equation is (with t statistics in brackets): 


252 Calendar Anomalies and Arbitrage 


58 W.T. Ziemba 


NSAI =) -818.63 + 75.23 Coml Land, 
(-5.19) (50.70) 
with R2 = 0.975, adjusted R2 = 0.975, DW = 0.468, autocorrelation = 0.759, and SS = 
891.21. The t-statistic of over 50 indicates the high degree of confidence that this 
variable's movements relate closely to those of the NSA. However, the Durbin- 
Wtson statistic is asymptotically equal to 2(1-p) hence at 0.468 there is significant 
positive serial correlation. 


Figure 7: The NSA and Commercial Land Index Values 
in the Six Largest Cities, 1955-1988 


NSA 
In this an succeeding figures, a solid line represents 
30000 the actual NSA plotted at six month intervals 
and the dotted line the prediction equation 
also plotted at six month intervals. / 
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Using commerical land in the six largest cities to explain the level of the NSA 
leaves little room for earnings to explain much. Indeed, adding earnings six 
months ahead, which would be a proxy for current earnings estimates, results in 
an equation with a non-significant earnings variable (although it has the expected 
sign), a lower R? and only a slightly lower sum of squared errors. The equation 
is: 


NSA; = -1297.57 + 75.77 Coml Land, + 36.18 EPSt41, 
(-2.79) (34.60) (0.861) 


Chapter 10: The Chicken or the Egg 


253 


Land and Stock Prices in Japan 59 


with R2 = 0.971, adjusted R2 = 0.970, DW = 0.476, autocorrelation = 0.759 and SS = 
870.38. Again the low Durbin-Watson value indicates positive serial correlation 
of errors. 

The earnings per share variable is significant with the log model: 


log NSA; = 3.008 + 0.8867 log Coml Land; + 0.6010 log EPS. 
(10.60) (26.60) (4.58) 


However, the fit is not as good, with R2 = 0.950, adjusted R? = 0.948, DW = 0.172, 
autocorrelation - 0.913, and SS = 0.24293. 

Figure 8 shows how much poorer this fit is compared to Figure 7. 

The log model with commerical land in the six largest cities does not fit nearly 
as well as the linear model. This model is 

log NSA; = 4.189 + 0.9734 log Coml Landt, 
(33.2) (31.0) 

with R2 = 0.936, adjusted R? = 0.936, DW = 0.120, autocorrelation = 0.940, and SS = 
0.27737. 


Figure 8: The NSA and Commercial Land Index Values in the Six Largest Cities 
and Six Month Future Earnings, Log Model Predictions, 1955-1988 


1955 58 68 62 64 66 68 78 72 74 76 78 8ð 82 84 86 88 


Using commerical land, the wholesale price index is now investigated. The 
importance of the wholesale price index in Japan can be gleaned from Figure 9 
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which shows that there is price parity at 1988-89 values of the yen if one uses this 
index. This is a painful reminder to all consumers in Japan of the extraordinary 
series of markups before a product the marketplace. The yen at 125-140 is greatly 
overvalued on a price parity basis for retail items compared to the U.S. dollar. 
(See Balassa and Noland, 1988, and Ziemba and Schwartz, 1991a, for surveys of 
these calculations.) See the discussion on a wholesale basis the yen is not 
overvalued. This should be noted by those who advocate a lower dollar to "solve 
the trade deficit problem" with Japan. With a higher yen Japanese manufacturers 
simply modified their operations to make their dollar costs lower. Indeed, as the 
yen rose in the post-1985 period Japanese manufacturers moved in this direction 
strongly. It is now commonplace in Japan for companies to list the yen/dollar 
exchange rate at which they will still be profitable and the percent of capacity at 
which this would occur. Some companies have their breakeven as low as 70¥/$ 
with twenty percent capacity. See the discussion in Ziemba and Schwartz (1991a). 


Figure 9: Yen-Dollar Exchange Rate and Purchasing Power Parity 
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Source: Yamaichi Research Institute, based on "Needs" 
published by the Nihon Keizai Shimbun Inc. and other data.. 


The prevailing view is summed up by Akio Morita, the Chairman of Sony, "A 
stronger yen would not help one jot: they will pay more for the same Japanese 
products and we will buy up America even more cheaply.” For an econometric 
analysis of the low dollar scenario see Marris (1989, 1987). See also Bergsten 
(1985). Marris' 1985 "predictions" were remarkably accurate during the fall 1985 to 
end-1987 period, but the recession he forecast did not come true. From 1988 to 
1990 the yen/$ rate has been in a trading range of 120-160. Marris and Bergsten 
among others are still advocating a weak dollar to help resolve the trade deficit. 
That may help. But I take the view, held my many Japanese, that the 
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econometrics simply will not work well as the structural characteristics of the 
Japanese productive economy are changing. What the U.S. needs more than a 
low dollar are better products that U.S. and foreigners want to buy, more savings 
and less consumption induced by higher taxes. 

Using the wholesale price index and commerical land in the six largest cities 
one obtains the following equation for the level of the NSA with both coefficients 
highly significant and an R2 of 98%: 


NSA, = -657.18 - 51.37 WPI, + 74.75 Coml Landt 
(-4.40) (-3.84) (55.20) 
R? = 0.980, adjusted R2? = 0.979, DW = 0.592, autocorrelation - 0.694, and SS = 
809.57. This is displayed in Figure 10. 


Figure 10: The NSA and Commercial Land Index Values 
in the Six Largest Cities and Wholesale Price Indices, 1955-1988 
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One obtains a slightly better equation, shown in Figure 11 and a near perfect fit 
from 1985-1988 when there was a steep increase in both land and stock values, by 
using commerical land lagged six months ahead. One then has 
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NSAt = -453.85 - 24.30 WPI + 65.94 Comml Landt41 
(-3.91)  (-2.28) (69.8) 
with R? = 0.987, adjusted R? = 0.987, DW = 0.339, autocorrelation = 0.824 and SS = 
642.14. 
Although not needed to predict the NSA, earnings per share does have the 
right coefficient sign and the coefficient is significant in both of the latter 
equations. For example, with contemparaneous commerical land 


NSA, = -1581.13 - 62.59 WPI + 72.17 Comml Land; + 89.58 EPS; 
(-3.71)  (-4.53) (41.9) (2.30) 


with R2 = 0.9815, adjusted R2 = 0.9806, DW = 0.683, autcorrelation = 0.650 and SS = 
783.63. 


Figure 11: The NSA and Commercial Land Index Values in the Six Largest Cities , 
Lagged Six Months Forward Wholesale Prices and Lagged NSA Indices, 1955-1988 
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Adding lagged NSA index values to the commerical land and the wholesale 
price index gives an R? above 99%. Leading or lagging or contemparaneous 
commercial land makes little difference in the R? fit but the coefficients change 
drastically. The commercial land six months ahead has the best fit, with good 
Durbin Watson statistics, as shown in Figure 12. 
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The three equations are 


NSA; = -287.08 - 26.40 WPI, + 23.85 Coml Landų1 + 0.7318 NSAqt 
(-3.56)  (-4.03) (5.18) (9.56) 


R? = 0.9942, adjusted R? = 0.9940, DW = 1.51, autocorrelation = 0.212, and SS = 
392.02. 


NSA; = -86.54 - 29.97 WPI + 3.64 Coml Land, + 1.0567 NSAt-1 
(-0.817) (-3.64) (0.548) (10.8) 


R2 = 0.9930, adjusted R2 = 0.9927, DW = 1.53, autocorrelation = 0.209, and SS = 
483.00. 


NSA = -46.78 - 31.15 WPI + 0.3425 Comml Landt-1 + 1.1043 NSAt-1 
(-0.429) (-3.53) (0.0575) (14.4) 


R2 = 0.9932, adjusted R2 = 0.9928, DW = 1.61, autocorrelation = 0.175, and SS = 
477.09. 


Figure 12: The NSA and Commercial Land Index Valuesin the Six Largest Cities, 
Lagged Six Months Forward, Wholesale Prices, and Lagged NSA 1955-1988 
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It is not very satisfying to use lagged NSA to predict current NSA. Except 
when using the future commercial land it takes over the equation. Indeed with 
lagged behind or contemporaneous commercial land, even that variable becomes 
insignificant. 
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We have the strong conclusion that the NSA stock index value is intimitely 
tied to the values of land in Japan, particularly commerical land in the six largest 
cities, and where these prices are headed in the next six months. Hence if the 
commonly discussed crash that many westerners see as inevitable happens, it will 
likely be tied to a simultaneous crash in land values. However, in 1990 there was 
a sharp drop in stock prices while land prices increased. 

Using bi-yearly data and price levels rather than price changes makes the 
analysis simpler and the predictions more accurate. This approach can be severely 
criticized since one is in effect using time as an explanatory variable. Nelson and 
Kang (1984) discuss this. This bias leads to lower variances and hence greater than 
true significance. However, the results are so strong that they seem to stand up 
well. Our purpose is modest, to simply ascertain whether or not land price levels 
were instrinically related to stock price level and which economic indicator seems 
to lead the other. The results seem to clearly indicate this dependence and the 
stocks leading land causality seems clear. But one realizes that the prediction of 
the changes even over six month periods is not that good by comparing Table 2 
and Table 7 in the Ziemba and Schwartz paper in this volume. While there were 
twenty-two stock price declines of 10% or more there was only one land price 
decline of about 5% in 1975-77, the second oil crisis period. Roll (1988) discusses 
the ability to predict stock price changes on daily or monthly basis using CAPM 
and APT models. The fits measured by R2 are in the 0.20 and 0.35 range for daily 
and monthly data, respectively and higher for quarterly returns. Stone and 
Ziemba (1990) have investigated such price change correlations for land and stock 
prices using quarterly data. They found that stock price changes do lead land price 
changes using Granger-Sims causality tests. The P-values for the hypothesis that 
Topix does not lead all land is only 0.0000149 versus 0.15872 for the reverse 
hypothesis that all land leads the Topix. Canaway (1990) has estimated that the 
Topix lead land by eleven months during the period May 1985 to April 1989. 

Stone and Ziemba concluded that: (1) stock prices are much more volatile than 
land prices; (2) despite their high level, the main increase in land prices was 
before 1971; the 1986-88 rise was high but the earlier rises were cumulatively 
much larger; (3) in late 1990 land prices were not falling but speculative land 
investment in condos and golf course membership, etc. fell sharply since March 
1990. These investments may be highly related to small stocks; (4) golf course 
membership price changes do not usually lead the stock market but they did for 
the 1987 crash; (5) golf course membership price changes do lead land price 
changes; (6) the golf course membership prices increased more rapidly than stock 
prices which in turn outpaced land price increases. As of March 1991 there was a 
considerable gap between Tokyo golf and the overall land market; see Figure 14. 
In the past the previous gaps particularly in 1987, have been quickly closed; and 
(7) despite a great desire of the government to cool down the land market and to 
try to engineer a 20-30% fall, extreme demand cash flows, cultural aspects and 
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unwillingness of the Diet to pass any serious tax laws that would probably 
generate such a fall, a great crash is unlikely. 


Figure 13 
The NSA Index versus the Nikkei Golf Membership Index, 1981:4-1991:1 
1000 
900 A, 
800 \ 
Golf led 
700 Oct 87 fall Na k 


81:4 82:4 83:4 84:4 85:4 86:4 87:4 88:4 89:4 90:4 


Source: Stone and Ziemba (1990) using data from th Nihon Keizai Shimbun, Inc. 


Stone and Ziemba (1990) found that the dynamic regression model shown in 
Table 9 predicts land price data using past stock price returns. 
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Table 9 
The Dynamic Regression Model to Predict Land Price Changes Using Past Stock 
Price Returns 


Variable Coefficient Standard Error t-Statistic Significance 
Constant 0.9689 0.4041 2.3980 0.0160 
Topix (-4) 0.0128 0.0072 1.7880 0.0740 
Topix (-3) 0.0355 0.0108 3.2910 0.0010 
Topix (-2) 0.0398 0.0108 3.6650 0.0001 
Topix (-1) 0.0161 0.0070 2.2920 0.0220 
Auto (-1) 1.6509 0.0833 19.7970 0.0001 
Auto (-2) -0.8001 0.0832 -9.6160 0.0001 
R2 0.954 

Adjusted R2 0.949 

Akaike Criterion (AIC) 0.464 

Schwarz Criterion (BIC) 0.526 

Durbin Watson 1.605 

RMS Error 0.411 


Source: Stone and Ziemba (1990) 


Figure 14 shows the results of the model in the sample estimation period, 
1972:2 to 1989:1, and in the forecast period, 1989:2 to 1990:3. 


Figure 14 

Prediction of Land Price Models In and Out of the Sample Estimation Period, 
Quarterly Data, 1972:3 to 1989:1 and a Forecast Period of 1989:2 to 1990:3 for the 
Dynamic Regression Model 
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Abstract: This paper investigates evidence on several seasonal regularities in the security price 
returns on the Tokyo Stock Exchange. The study uses data on the NSA and TOPIX market 
indices from 1949-88. Results are presented concerning monthly, turn-of-the-month and first- 
half-of-the-month, turn-of-the-year, holiday and golden week effects on the TSE. 


Keywords: Seasonal regularities, monthly effects, turn-of-the-month effects, turn-of-the-year 
effects, holiday effects, golden week effects, NSA market indices, TOPIX market indices. 


1. Introduction 


Research in English on Japanese anomalies is quite recent. Part of the 
reason for this is a lack of interest in such studies by the big Japanese 
brokerage firms who are unaware and suspicious of their potential use. These 
firms do not like to publish their research findings even in Japanese. Recently, 
however, U.S. and Japanese researchers have been studying these markets. The 
thrust has been mainly to ascertain the similarities and differences with the 
analogous results in U.S. markets. Kato, Schwartz and Ziemba (1989) have 
surveyed the research and presented new results on day-of-the-week effects in 
Japanese security markets. Other aspects of Japanese security markets are 
discussed in Elton and Gruber (1989), Ziemba and Schwartz (1991) and 
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Ziemba, Bailey and Hamao (1991). The U.S, literature on seasonal regularities 
is surveyed in Schwert (1983), Keim (1986), Jacobs and Levy (1988) and the 
books by Dimson (1988) and Ziemba (1992). 

To begin our study of seasonal regularities and to provide background for 
the reader, the paper begirs with a survey of recent work on the January, 
monthly and size effect literature which is based on data for subsets of the 
1949-88 periods for the first section of the (TSE). Since the first section of the 
TSE has about 86% of the trading values and volume, its study provides a 
good measure of the whole market. As of December 1988 the first section of 
the TSE had a capitalization of about ¥ 450 trillion which was more than the 
value of all the stock exchanges in the U.S. This survey is then complemented 
by evaluating the monthly effects over the whole sample period. New results 
on the other effects are discussed in succeeding sections of the paper. The 
basic efficient markets working hypothesis is that all days are equivalent and 
have the same mean returns. The seasonality results show that there are 
significant departures from this. An attempt is made to discuss why such 
departures occur for the various effects. The answers seem to be a combina- 
tion of cash flows, institutional and cultural factors and differences in risk. 


2. The January and monthly effects 


There seems to be a January effect in Japan similar to that in the U.S., 
Canada and many other countries. In Japan, the effect is not based on tax loss 
selling. For individuals, there were no taxes on capital gains or credits on 
losses during the period of this study. Corporations must pay taxes on capital 
gains but each firm can choose its own tax year and the majority are in March 
not at the turn of the year. 

In addition to excess gains in January particularly for small stocks, there 
are excess gains in June for small stocks. A major reason for these gains seems 
to be the large semi-annual bonuses paid by most Japanese companies in 
December and June. The precise dates vary but it is typical to pay these 
bonuses early in the month. For example, the big brokerage firms pay these 
bonuses near the beginning of the month. These bonuses can amount to as 
much as three months salary and they are steeped in tradition. They provide a 
great measure of flexibility to Japanese corporations facing economic swings. 
Employees purchase gifts - called Ochugen in the summer and Oseibo in the 
winter. They also invest some of their considerable savings in the stock market 
and this boosts the prices. Earnings forecasts may also be a factor. Corporate 
officers often make their earnings forecasts in May and financial analysts 
make theirs in March, June, September and December; see Kunimura (1984). 
Darrough and Harris (1991) and Ziemba (1990) show the strong effects of 
earnings on stock prices. Whether or not the timing of earnings forecasts 
injects a seasonality effect in the market is not well understood. There is, 
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however, a tendency to delay bad news and accelerate good news which may 
be a factor in the turn-of-the-month effect. 

Kato and Schallheim (1985) studied the monthly returns in Japan during 
the twenty-nine year period 1952 to 1980. The number of firms with clear data 
ranged from 529 in 1964 to 844 in 1980. Those firms that were delisted or had 
other data irregularities had similar monthly effects to those firms in the 
sample. Equally and value weighted indices were constructed from the Nissho 
Monthly Stock Price file (1952-80) and the Nikkei Needs Financial Data file 
(1964-81). They found that there was a small firm effect: the average monthly 
return for the equally weighted index (EWI) was 0.42% higher than the value 
weighted index (VWI). This yearly edge of (1.0042)'? — 1 = 5.16% is similar to 
that found in the U.S. in this period. The 5.16% is an overstatement of the 
actual buy and hold mean return difference due to the bias discussed in Roll 
(1983b). However, only the post 1964 period has the small firm effect in 
Japan. This is similar to the U.S. [see, e.g. Ziemba (1992) for precise results 
and references]. In Japan this also corresponds to the opening up of the 
Japanese economy to foreign investors. There is also a strong January seasonal 
effect in both the equally and value weighted indices of 7.08% and 4.48%, 
respectively, relative to the other months. The mean return difference was 
larger in the 1952-63 period, 8.27% and 5.99%, respectively, than in the 
1964-80 period, which gained 6.24% and 3.41%, for the EW! and VWI, 
respectively. From 1964 to 1980 Kato and Schallheim found the mean return 
differences in January returns to be size dependent. For the equally weighted 
index, the excess gain in January relative to the other months ranged from 
8.68% for the smallest decile firms and 3.18% for the largest decile firms with 
the other firms being 


Largest 2 3 4 5 6 7 8 9 Smallest 
3.18 3.82 4.42 6.25 6.12 7,16 8.15 8.20 8.55 8.68 


For the value weighted index the January versus the rest of the year mean 
return differences were 


Largest 2 3 4 5 6 7 8 9 Smallest 
— 0.98 —0.19 0.61 1.19 2.18 3.34 4.05 4.38 4.87 5.16 


Analogous to the U.S. Value Line-S&P500 spread gains in January, see 
Clark and Ziemba (1987) and Vander Cruyssen and Ziemba (1991), Kato and 
Schallheim computed the mean return difference between the equally and 
value weighted indices by portfolio size. This is not the best measure for the 
small versus large firm spread but one that gleans the effect. The spread in 
January excess returns relative to the CAPM model using equally and value 
weighted indices were 


Largest 2 3 4 5 6 7 8 9 Smallest 


EW — 2.48 —2.01 -144 -084 -052 053 108 140 1.82 2.02 
YW — 0.98 —0.19 0.61 1.49 2.18 3.34 405 4.38 4.87 5.16 
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Fig. 1. Mean daily returns by month in Percent for various small stock portfolios versus the 
TOPIX, 1975-88. 


Source: Yamaichi Research Institute. 


The monthly size effect is shown in fig. 1 using our updated data from 1975 to 
1988 using daily returns for the smallest 10, 25, 50 and 300 stocks and the 
TOPIX. Kato and Schallheim also found elements of June effect. So does the 
more recent data. Both January and June have significantly higher returns 
than the other months for small firms. Actual calculations appear in Kato 
(1990b) and Jaffe and Westerfield (1985a, b). 

Another study (unpublished and in Japanese) by Horimoto (1988) based on 
monthly return data on the equally weighted index is described in tables 1 and 
2 for all the stocks on the first section of the TSE from January 1965 to 
December 1987. Table 1 indicates that the smallest decile stocks gained 9.53% 
in January on average which is 7.21% more than the largest decile stocks. The 
smallest stocks also gained 4.34% more, on average, than the largest stocks in 
June. March and December are strong months relatively for the big stocks 
probably because of window dressing and corporate dividend and reporting 
effects. By month the highest average returns on the TSE were in January, 
followed in order by March, December and June. There seem to be two 
windows of opportunity for small stock buyers. The period January and 
February has a higher mean return of about 8.6% and the April to July period 
has one of about 8.2%. Small stocks are not in favor by the big four securities 
companies hence they have not encouraged their clients to bid up these prices 
to close these rather large differences in small versus large firm average 
returns. The data also show clearly the observation that September and 
October are typically weak months for all stocks. Risks in small stocks are 
higher than in big stocks as measured by the standard deviations shown in 


Mean returns in percent of ten size deciles on the first section of the TSE, by month 1965-87. 


Table 1 


Firm Jan. Feb. Mar. Apr. May June July Aug. Sep. Oct. Nov. Dec Mean Stdev St dev 

decile stocks index 

size 

Smallest 1 9.538 2.108 1.289 1.305 2536 5.308 2.759 1.543 -0.426 1.135 1.081 2625 2.567 6107 2.591 
2 7.704 2.379 2.064 0.622 1.818 4436 2.155 0859 —0.261 0.568 1.083 2.350 2.148 5.186 2.126 
3 6.993 1.796 1.908 0697 1.876 3.963 2.374 0.881 0.229 0.327 1.036 2.140 2.018 5.005 1.879 
4 6.303 1568 2.882 0.790 1.042 3.403 0.670 0.877 -90.270 0.545 0.831 2.102 1.729 4.843 1.774 
5 5.931 2.002 3.162 0.681 1.250 2.603 1.126 1.315 0.258 0.224 0.860 2.052 1.789 4811 1.586 
6 4866 1.938 3.194 0.795 1.350 2.390 0.816 1.289 -0001 -—0.365 1.255 2.098 1635 4.513 1.419 
7 4596 1.706 3.315 0.875 0.800 2.506 0.327 1.388 0.580 0.000 0.898 1.759 1.563 4.422 1.338 
8 3.232 1.373 3.527 0.934 0.619 1.454 0.232 1.457 0.830 —0.104 1.215 1.906 1.390 4.100 1.085 
9 2850 1.446 3.663 0.551 0.714 1.281 0.290 0.739 0.742 —0.156 1.579 2.567 1.356 4.351 1.141 

Largest 10 2.332 0.774 3.763 0.934 0.899 0.966 0.461 1.213 1.129 —0.430 1.295 3.142 1.373 4.808 1.163 

Mean 5.435 1.709 2.877 0.818 1.290 2.831 1.121 1.156 0.281 0.174 1.113 2.274 2.757 4.815 1.610 


Source: Adapted from Horimoto (1988). 
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Firm Jan. 

decile 

size 

Smallest 1 4.752 
2 3.747 
3 3.518 
4 2.835 
5 3.107 
6 2.530 
7 2.349 
8 2.478 
9 2.107 

Largest 10 3.777 

Mean 3.120 


Source: Adapted from Horimoto (1988). 


Feb. 


3.897 
3.604 
3.078 
2.516 
3.130 
3.405 
3.232 
3.275 
3.638 
3.896 
3.367 


Standard deviations of ten size deciles on the first section of the TSE, by month 1965-87. 


Mar. 


Apr. 


May 


June 


5.197 
4.399 
4.840 
4.002 
3.255 
3.272 
3.270 
2.753 
2.511 
2.779 
3.628 


Table 2 


July 


Aug. 


8.000 
6.829 
7.276 
7.470 
7.704 
6.838 
6.808 
6.073 
6.204 
5.862 
6.906 


Sep. 


Oct. 


6.648 
5.350 
5.642 
5.169 
5.080 
5.334 
4.835 
4.819 
4.751 
5.249 
5.288 


Nov. 


5.381 
5.546 
4.991 
5.332 
5.553 
5.276 
4.771 
3.677 
5.275 
5.860 
5.266 


Mean 


St dev 
of stock 


5.564 
4.780 
4.645 
4.447 
4.484 
4.238 
4.159 
3.933 
4.147 
4.683 
4.508 


P71 


Saiiavn$ad Jayspus ANIIS asauvdys / vquaiz LM 


897 


aboIIQLy PUD SIYDUWOUY LOPUD 


Chapter 11: Japanese Security Market Regularities 


269 


W.T. Ziemba / Japanese security market regularities 125 


table 2. However, the risks in January and June are lower than in most other 
months and are below the mean monthly standard deviation. They are also not 
much higher than the average standard deviations by month for all ten of the 
deciles, Small firms in Japan in recent years have had low beta values. One can 
conjecture that the low betas are probably due to thin trading and the bias 
that this gives to beta estimates that are not computed using the Dimson or 
Scholes—Williams estimation methods. However, upon calculation these dif- 
ferences are minor. The small stocks simply do not move with the market 
indices in a regular pattern. For details, see Ziemba and Schwartz (1991). The 
advent of programmed trading once futures contracts were available on the 
SIMEX in 1986 and later in Japan has accentuated this disassociation of the 
small stocks from the NSA index. 

For the five-year period July 1983 to June 1988 the average beta for the 
NSA (relative to the TOPIX’s 1.00) estimated by the TSE was 0.84. For the 
largest capitalized stocks it was 1.16, for the medium caps 0.48 and for the 
smallest stocks only 0.37. For this study small firms were defined to be those 
with less than 60 million shares outstanding, medium sized firms with 60 to 
200 million shares and large firms more than 200 million shares. The TOPIX 
index is a value weighted average of all, some 1135, stocks on the first section 
of the TSE. The NSA is a price weighted average of 225 well known stocks. Its 
value is computed like the Dow Jones Industrial Average (DJIA) by adding up 
the values and dividing by the current divisor. More details on these indices 
appear in Ziemba and Schwartz (1991). These results are shown in fig. 2 using 


Mean return 
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Fig. 2. Mean returns by month and decile size for TSE stocks, 1985-87. 


Source: Constructed from data compiled by Horimoto (1988). 
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MRO = Smallest minus largest stock decile 


Fig. 3. Mean return differences by month and decile size for TSE stocks, 1985-87. 


Source: Constructed from data compiled by Horimoto (1988). 


capitalization deciles. Fig. 3 shows the monthly small firm minus large firm 
return differences. 

Kato (1990a) investigated the small-firm effect using daily returns on all 
TSE stocks, namely the value-weighted TOPIX index, from 1974, when there 
were 806 securities, to 1987 when there were 1069 securities. He used yearly 
rebalancing on portfolio size and ranked the securities into five size categories 
with 1 denoting the smallest capitalized stocks and 5 the largest. The average 
firm sizes for these groups were 773 million yen to 32,689 million yen, 
respectively. Table 3 describes the results. 

The small stocks outgain the big stocks by a two to one margin and the gain 


Table 3 
Mean returns for size related portfolios, TOPIX, 1974-1987. 

Firm size Average Mean retum 

market value 

ies Close to Close to Open to 

(¥ million) close open close 
Smallest 773 0.1492 0.1784 — 0.0295 
2 1,716 0.1058 0.1545 — 0.0489 
3 3,275 0.0859 0.1354 — 0.0495 
4 6,554 0.0764 0.0976 — 0.0213 
Largest 32,689 0.0653 0.0483 0.0168 


Source: Kato (1990a). 
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Table 4 
Average monthly returns on a portfolio containing all the stocks in the sample (from 373 to 566 
stocks) over the period January 1955~December 1985. 


Average monthly Sample Equally-weighted portfolio value-weighted portfolio 
PerUrR NEE Sze Average t-Statistic Average t-Statistic 
return return 

AH months 372 1.60% *** 6.40 1.84% *** 7.55 
All months 

but January 341 1.24% *** 4.75 1.61% *** 6.22 
January 31 5.58% *** 13.17 4.36% **™ 9.85 
February 31 1.00% 1.85 0.99% 1.73 
March 31 2.90% *** 3.77 3.85% *** 4.48 
April 31 0.51% 0.57 1.01% 1.23 
May 31 0.18% 0.20 0.86% 0.91 
June 31 2.51% ** 4.59 2.43% **™ 4.09 
July 31 0.65% 0.75 0.44% 0.51 
August 31 0.92% 0.85 1,05% 1.09 
September 31 0.13% 0.17 1.28% 1.85 
October 31 0.52% 0.48 0.49% 0.56 
November 31 1.81% 1.76 2.42% * 2.27 
December 31 2.48% ** 3.02 2.87% ** 3.25 


^ In this and succeeding tables * indicates that the average return is significantly different from 
zero at the 5 percent level, with a two-tail test, ** at the 1% level and *** at the 0.1% level. 
Source: Hawawini (1991). 


is monotonic in size. So the usual size effect was present in Japan during 
1974-87. Like in the U.S. the yearly size effect is not present during 1983-87. 
However, even in this period small firms do have much higher absolute and 
risk adjusted returns in January and June. Interestingly all the gains occur in 
the non-trading period at night as the close-to-open returns are statistically at 
least as high as the total, i.e., close-to-close returns. The returns during the day 
provide no gains at all and in fact are slightly negative except for the highest 
capitalized stocks. This finding is somewhat akin to Keim and Smirlock’s 
(1987) results for U.S. stocks who found that the major gain in the Value Line 
small stock index was mostly at night and there were larger gains during the 
day for the large cap S& P500 index. See Kato (1990a) for an analysis of these 
effects by day of the week in January and other months and Kato, Schwartz 
and Ziemba (1989) for some plausible reasons for this behavior. 

Additional insight has been provided by Hawawini (1988, 1991). His study 
used monthly data for the 31 years January 1955 to December 1985, The firms 
he considered trade on the first section of the TSE. There were 373 firms 
which traded continuously during this entire period and 566 which traded in 
the second half, January 1970—-December 1985. The effect across months 
appears in table 4. January has the highest returns for small stocks (the 
equally-weighted index) averaging 5.58% per month. The big stocks (the value 
weighted index) do nearly as well, returning 4.36% per month. June and 
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Table 5 
Risk and return characteristics of the largest and smallest size portfolios partitioned into four 
sub-portfolios ranked by decreasing magnitude of their beta coefficient risk TSE, January 
1955—December 1985. ° 
Average Sub-portfolios ranked by risk Average monthly return 
portfolio size Risk Size Allvear = January 
eee eae year January 
(in millions) (beta) (¥ millions) 
¥ 164,716 1.44 ¥ 158,275 1.29% 6.20% 
largest 0.89 ¥ 179,708 0.77% 3.33% 
quintile 0.57 ¥ 160,110 0.61% 2.74% 
0.21 ¥ 160,772 0.69% 1.14% 
¥ 4942 1.97 ¥ 5,406 2.02% 12.40% 
smallest 1.37 ¥ 4,776 1.34% 8.16% 
quintile 0.97 ¥ 4,710 1.35% 5.75% 
0.43 X 4,876 1.42% 2.51% 


* All beta coefficients and mean returns are significantly different from zero at the 5% level. 
Source: Hawawini (1991), 


December, the bonus months also have high returns in the 23% range, and 
March has high returns as well, namely 2.90% and 3.85% for the small and big 
stocks, respectively. 

The January effect and the excess gains of small stocks over big stocks are 
beta dependent. The small stocks have higher returns than the big stocks and 
the more so the higher the £ is. The smallest stocks with the highest 8s return 
12.40% in January versus only 1.14% for the largest stocks with the lowest Bs. 
Table 5 shows this effect which is analogous to that observed on U.S. stocks. 
See, Ritter and Chopra (1989). A strategy to exploit this is to invest in small 
stocks in January especially those with high betas and then if your transactions 
costs are small moving into big stocks for the rest of the year. For the period 
1975 to 1984 without transactions costs one has returns in U.S. dollars as 
listed in table 6. 

The strategy of being in small stocks in January and big stocks otherwise 
has lower risk than even big stocks throughout the year and nearly as high 
mean returns as being in small stocks all the time. Adding transactions costs 


Table 6 
Returns from various investment strategies on the TSE, January 1955 to December 1985. 


Strategy Arithmetic Geometric Standard deviation 


mean mean using arith. mean 
TOPIX 18% 17% 18% 
Bottom quintile on TSE 
1st Section 28% 22% 38% 
Bottom quintile in Jan. 
and TOPIX rest of year 22% 21% 14% 


Source: Hawawini (1991). 
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Table 7 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1949-1988. 
Month Sample Mean Standard t-Statistic p-Statistic % Positive 
size return, deviation returns 
% of return, 
% 

All days 11529 0.0482 *** 0.0933 5.56 0.0001 54.3 
January 885 0.1816 *** 0.0926 5.80 0.0001 60.7 
February 905 0.0549 0.0886 1.87 0.0622 56.7 
March 986 0.0457 0.0955 1.50 0.1335 55.0 
April 947 0.0623 * 0.0913 2.10 0.0359 54.2 
May 956 0.0074 0.0835 0.27 0.7843 53.1 
June 1011 0.0641 * 0,0852 2.39 0.0169 56.8 
July 1043 0.0083 0.1012 0.27 0.7904 52.3 
August 1038 0.0790 ** 0.0859 2.96 0.0032 55.2 
September 931 0.0059 0.0820 —0.22 0.8252 51.6 
October 998 0.0088 0.1150 0.24 0.8089 50.9 
November 910 0.0371 0.0879 1.27 0.2031 52.4 
December 919 0.0470 0.1038 1.37 0.1706 53.8 


Source: Yamaichi Research Institute. 


for the extra round trip transaction would reduce the difference in specific 
returns by 1-2% per year, but would not change the conclusion that the 
January small stock, TOPIX rest of year strategy dominates the TOPIX alone 
strategy. Additional research on these types of strategies appears in Ziemba 
and Schwartz (1991). 


3. The monthly effect on the NSA and TOPIX market indices, 1949-88 


Table 7 gives the monthly returns on the NSA‘ from 1949-1988 (Septem- 
ber). The data in this section are over longer periods than that discussed 
above, namely, the entire 1949-1988 sample period. January has by far the 
highest returns and all months are positive except for September which is just 
slightly negative. With January excluded one cannot reject the hypothesis of 
equal mean returns in all months. January has significantly higher returns than 
the other months. The market increased 60.7% of the days compared to 54.3% 
in all months (including January). The other months have similar behavior 
with September and October having the lowest returns. Even then more days 
rise than fall. 


! The NSA is a price weighted average of 225 stocks on the first section of the TSE. It is 


computed like the DJIA by simply adding up the values of the 225 stocks and dividing by the 
current divisor, which was 10.289 as of the end of December 1988. The NSA then had a beta of 
0,84 relative to the TOPIX index. 
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Table 8 
Mean daily return standard deviation, and percent positive returns by month of the NSA, 
1949-1959. 
Month Sample Mean Standard t-Statistic p-Statistic  % Positive 
size return, deviation returns 
% of return, 
% 
All days 3193 0.0565 ** 1.119 2.85 0.0044 53.6 
January 232 0.1990 * 1.302 2.33 0.0207 59.9 
February 239 0.0495 1.284 0.60 0.5519 57.2 
March 256 ~— 0.1284 1.290 -1.59 1128 47.7 
April 247 0.1446 * 1.068 2.13 0.0343 52.2 
May 262 0.0142 0.863 0.27 0.7904 50.7 
June 283 0.0192 0.952 0.34 0.7350 54.4 
July 291 0.0897 1.441 1.06 0.2895 54.4 
August 292 0.2482 *** 0.798 5.31 0.0001 59.6 
September 273 0.0330 0.813 0.67 0.5044 55.3 
October 292 0.0481 0.968 0.85 0.3965 52.1 
November 262 0.0031 0.939 0.05 0.9574 48.1 
December 264 — 0.0460 1.453 — 0.51 0.6077 52.3 


Source: Yamaichi Research Institute. 


To investigate the changing patterns, tables 8-11 consider the past four 
decades. The main differences are: 
(a) During the 1950s August had very high returns, surpassing even January, 
while December was slightly negative and March was significantly negative. 
Both January and August had gains on about 60% of the days, while March 
had losses on more days than gains. 


Table 9 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1960-1969. 
Month Sample Mean Standard = ¢-Statistic p-Statistic % Positive 
size return, deviation returns 
% of return, 
% 

All days 3000 0.0368 * 0.868 2.33 0.0201 52.8 
January 232 0.2170 *** 0.725 4.56 0.0001 62.9 
February 241 0.0271 0.673 0.62 0.5327 52.7 
March 258 0.0483 0.958 0.81 0.4187 51.6 
April 248 0.0465 0.770 0.95 0.3424 51.6 
May 246 — 0.0128 0.864 -0.23 0.8170 53.3 
June 257 0.0906 0.933 1.56 0.1211 56.8 
July 265 — 0.0592 0.920 —1.05 0.2957 51.3 
August 267 0.0139 0.783 0.29 0.7728 48.7 
September 246 — 0.0362 0.856 — 0.66 0.5079 47.6 
October 260 — 0.0135 0.101 — 0.22 0.8298 49.2 
November 240 0.0448 0.944 0.74 0.4631 52.9 
December 240 0.0965 0.868 1.72 0.0866 56.7 


Source: Yamaichi Research Institute. 
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Table 10 
Mean daily return, standard deviation, and percent positive retums by month of the NSA, 
1970-1979. 
Month Sample Mean Standard t-Statistic p-Statistic % Positive 
size retum, deviation returns 
% of return, 
% 

All days 2892 0.0391 * 0.853 2.46 0.014 55.3 
January 224 0.1370 * 0.830 2.47 0.014 59.4 
February 225 0.1075 0.830 1.94 0.053 59.1 
March 252 0.1158 ** 0.064 2.87 0.004 60.3 
April 239 ~ 0.0555 1.016 — 0.84 0.399 51.9 
May 240 0.0219 0.810 0.42 0.676 56.7 
June 251 0.0794 0.834 1.51 0.133 57.8 
July 258 0.0193 0.628 0.49 0.622 52.3 
August 259 — 0.0686 1.089 — 1.01 0.324 55.2 
September 232 0.0749 0.758 0.15 0.880 51.3 
October 249 — 0.0915 0.989 -0.15 0.884 49.8 
November 230 0.0335 0.812 0.63 0.532 54.3 
December 233 0.0977 0.898 1.66 0.098 55.4 


Source: Yamaichi Research Institute. 


(b) In the 1960’s January was very strong, having average returns of about 
0.21% per day and the market advanced about 63% of the time. After January, 
the market had a reasonable showing in the first half and then fell in the 
second half of the year but managed to close positively in December. The 


Table 11 
Mean daily return, standard deviation, and percent positive returns by month of the NSA, 
1980-1988. 
Month Sample Mean Standard i- p- % Positive 
size return, deviation Statistic Statistic returns 
% of return, 
% 

All days 2444 0.0625 *** 0.8242 3.75 0.000 56.1 
January 197 0.1658 *** 0.6825 3.41 0.001 60.4 
February 200 0.0361 0.5209 0.98 0.328 58.5 
March 220 0.1648 ** 0.7471 3.27 0.001 61.8 
April 213 0.1175 * 0.7212 2.38 0.018 62.4 
May 208 0.0060 0.8006 0.11 0.914 51.9 
June 220 0.0738 0.6054 1.81 0.072 58.6 
July 229 — 0.0292 0.7747 — 0.57 0.569 50.2 
August 220 0.1074 * 0.6612 2.41 0.017 57.3 
September 180 — 0.0406 0.8047 — 0.68 0.499 51.7 
October 197 0.0027 1.6574 0.02 0.982 52.8 
November 178 0.0814 0.7784 1.40 0.165 $5.6 
December 182 0.0516 0.5936 1.17 0.242 50.0 


Source: Yamaichi Research Institute. 
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Fig. 4a. Mean return in percent by month of the year, 1949-1988. (a) During January to March on 
the TOPIX Index. (b} During April to June on the TOPIX Index. (c) During January to March on 
the NSA Index. (d) During April to June on the NSA Index. 


Source: Yamaichi Research Institute. 
bonus month of June was positive and the market rose about 57% of the time 


then. The July-November period was very weak. 
(c) In the 1970s January again had the highest returns and again June and 
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Fig. 4 (continued). 


December had strong returns. The second half of the year, July-November 
was again weak with net losses in this period although the market did manage 
to close higher on more than half of the days. February and March were 
strong and had returns above December and nearly as high as January. The 
market was up about 60% in these months similar to January’s performance. 
(d) In the 1980s January again had the highest return but only marginally 
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Fig. 5a. Mean return in percent by month of the year, 1949-1988. (a) During July to September 
on the TOPIX Index. (b) During October to December on the TOPIX Index. (c) During July to 
September on the NSA Index. (d) During October to December on the NSA Index. 


Source: Yamaichi Research Institute. 


above March. July was weak again as was the September—October period. 
Other months, such as April, August, November and December, had strong 
returns. 
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Fig. 5 (continued). 


(e) The decades returned 0.056%, 0.037%, 0.039%, and 0.062%, respectively for 
an average gain over the 39 years of 0.048% per day. 


Figs. 4a—d and a-d provide the monthly returns year by year for the 
TOPIX and the NSA from 1949 to 1988. These results are more variable than 
the aggregated results but yield similar conclusions. 
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4. Turn-of-the-month effect 


In the United States the returns on trading days —1 to +4 of each month 
dominate the other days. This is referred to as the turn-of-the-month effect. 
The return in this period, coupled with that in the second week of the month, 
roughly trading days +5 to +8 or +10, essentially amounts to all the gains 
in the spot stock market in the period of the 1960s, 70s and 80s. See Ariel 
(1987) for spot data for the period 1963-82 and Sick and Ziemba (1991) for 
futures data for 1982-91. The rest of the months’ returns is essentially noise 
and at best provides zero returns. The reason(s) for this effect are not fully 
known but part of the story is likely that people receive their salaries on or 
around the —1 day and they receive their stock account statements so they 
have funds to invest in stocks. There are also portfolio balancing effects. When 
would one expect the turn-of-the-month to be in Japan? Most companies pay 
their salaries on the 25th of the month. Does the turn-of-the-month start then 
and is it similar to that in the U.S.? 

Figure 6 displays the daily returns over the 27 possible trading days each 
month in Japan for the NSA, 1949-1988. There are higher returns in Japan’s 
turn-of-the-month: — S5 to +2,. a seven-day trading period. Each of these days 
has returns of 0.10% or more on average and these returns are all statistically 
significant at the 5% level. The —1 day has very high returns, as in the U.S. 
The mean returns are over 0.22%, making its effect about as strong as a 
pre-holiday, namely providing returns about five times as large as on a typical 
trading day. The first-half-of-the-month effect, namely days —5 to +7, a 
twelve-day trading period, is also present. The fifteen-day period +8 to +17 


OMUZUNDA 
Mean return (%) 


Fig. 6. Mean rates of return in percent by trading day-of-the-month, NSA, 1949-1988. 


Source: Yamaichi Research Institute. 
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Table 12 
Turn-of-the-month data, 1949-1988. 
Trading Sample Mean Standard Maximum Minimum Median /-Statistic 
day size return, deviation return, return, return, 
% return, % % % 
% 

= 5 471 0.0899 ** 0.710 3.650 — 2.578 0.0831 2.75 
-4 471 0.1041 ** 0.853 5.321 —2.853 0.1197 2.65 
-3 471 0.1733 *** 0.984 4.118 — 4.216 0.2365 3.82 
-2 471 0.1334 *** 0.911 4.555 — 6.734 0.1392 3.18 
-1 471 0.2255 ** 0.914 2.750 — 8.686 0.3005 5.36 
1 471 0.0980 * 0.875 4.501 — 3.987 0.1563 2.43 
2 471 0.1006 * 0.884 3.673 — 4.849 0.1313 2.47 
3 471 0.307 0.924 5.633 — 4.368 0.0202 0.72 
4 471 0.0592 1.113 11.149 — 9,997 0.0437 1.15 
5 471 0.0358 0.957 6.307 — 10.649 0.0305 0.81 
6 471 — 0.0005 0.811 4.41 —3,.899  -0,0092 -0.01 
7 471 0.0357 0.838 3.471 — 4.600 0.0573 0.09 
8 471 — 0.0585 0.945 4.554 — 8.218 0.0112 — 1,34 
9 471 0.1065 * 0.940 6.877 — 4.716 0.1085 2.46 
10 471 0.0620 0.852 4.795 — 4.399 0.0570 1.58 
11 471 0.0395 0.813 5.394 — 4.025 0.0566 1.05 
12 471 — 0.0196 0.850 4.681 — 6.970 0.0531 — 0,50 
13 471 0.0115 1.055 11.289 — 7.680 0.0201 0.24 
14 471 — 0.0042 0.912 6.408 — 3,542 0.0218 - 0.10 
15 471 — 0.0306 1.237 6.138 — 14.901 0.0616 - 0.54 
16 471 0.0716 1.148 9.888 — 7.493 0.871 1.35 
17 471 — 0.498 0.888 4.716 — 4.253 —0.0380 -1.20 
18 429 — 0.0207 0.870 3.582 — 4,930 0.0206 — 0.49 
19 350 0.0162 0.841 3.924 -5.032 -0.0344 0.36 
20 229 — 0.0286 0.899 4.604 —4.705 —0.0104 — 0.48 
21 118 — 0.0476 0.354 3.600 — 6.610 0.0680 — 0.46 
22 39 0.0562 0.778 2.008 — 1.732 0.0079 0.45 


Trading days 
lst half 22-7 0.01142 
2nd half 8-21 0.00048 


Source: Yamaichi Research Institute. 


and —10 to —6 amounts to a second half of the month which has returns that 
are at best noise. In fact, these returns are negative, on average. The hypothe- 
sis that these returns are less than or equal to zero cannot be rejected at the 5% 
level. 

It is a difficult to compare the days consistently, as each month in Japan is 
lightly different. For example, Saturdays trade on the first, fourth and fifth (if 
there is one) week of the month. Table 12 shows the —5 to +2 and —5 to +7 
periods more clearly. These periods have the highest returns. The remainder of 
the month, starting either on day +8 or possibly on +12 has zero or negative 
returns. This is striking, since the average day during these 39 years had 
returns of nearly 0.05% per day. 

The numerical data of table 12 appears in fig. 6. If we let day +22 be in 
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our turn-of-the-month, then the 22, —5 to +7 period returns more than 1% a 
month, namely 1.142%, on average while in the +8 to +21 period (+8 to 
+17 and —10 to —6 in the other counting) the returns are slightly positive 
but they are insignificant from zero return of only 0.0482%, on average, which 
is only one in 23.7 times as much as in the turn of the month. The days of the 
week have a considerable effect on mean stock returns; see Kato, Schwartz 
and Ziemba (1989) and Kato (1990a). Hence the turn-of-the-month effect is 
commingled with the days-of-the-week effect. Ziemba and Schwartz (1991) 
present the results of a 52-variable model to separate out these effects to 
develop seasonality calendars to evaluate and rank all the days on a common 
basis. 

Discussions with experienced brokerage executives provided the following 
list of plausible reasons for the turn-of-the-month effect. They involve finan- 
cial flows or institutional arrangements that collectively seem to cause the 
effect. To test these hypotheses rigorously would require considerable statisti- 
cal ingenuity and data collection effort. 


- Most salaries are paid on days 20-25 of the month with the 25th being 
especially popular. This leads to buying pressure on the 25th. 

- There is portfolio window dressing on day —1. 

- Security firms can invest for their own accounts in amounts based on their 
capitalization. Since their capitalization usually rises each month and is 
computed at the end of the month, there is buying as early as on day —3 to 
account for this (because of the three day settlement process on the TSE). 

- Large brokerage firms have a sales push that lasts 7 to 10 days and this 
starts on day — 3. 

- Employment stock holding plans and mutual funds receive money in this 
period to invest, starting around day —3. 

~ People buy mutual funds with their pay which they receive on calendar days 
15 to 25 of the month; then these funds are invested in stocks with a lag, so 
most of the buying occurs on days —5 to +2. 


5. Holiday effects 


There is a holiday effect in Japanese securities similar to that in U.S. 
securities as documented by Ariel (1988). See also French and Roll (1986), 
Lakonishok and Smidt (1989), Zweig (1986) and Ziemba (1991). The mean 
return on all days on the NSA from 1949-1988 is 0.048% per day. On the 
pre-holidays the return is about five times as high: 0.246% per day. Moreover 
the risk as measured by the standard deviation is also lower: 0.794% versus 
0.979% per day, respectively. As in the U.S. there are no abnormal gains on 
the days around holidays except for the pre-holiday trading day. For example, 
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Table 13 

Holiday effects on the NSA, 1949-1988. 
Day Sample Minimum Maxi- Mean Standard t- 

size retum, % mum return, deviation, Statistic 

return, % % 
% 
Pre-holiday 408 — 5.4069 3.0631 0.2461 *** 0.7943 6.26 
Non pre-holiday 7143 —14,9009 11.2893 0.0489 *** 0.9786 4.22 
Days after 
holidays 408 — 8.6856 4.7122 0.0068 0.9893 0.14 


Pre-holidays, 
Fridays and Saturdays 
preceding market 


closing 2268 —6.6101 6.8765 0.1561 *** 0.7242 10.27 
Days after holidays 
and weekends 2268 — 8.6856 5.3938 — 0.0480 * 0.9473 -2.41 


Source; Yamaichi Research Institute. 


the days following holidays had returns of 0.0067% per day on the NSA during 

1949-1988 which is less than a typical day. A regression shows this: 

R = 0.0352 +0.0799 Day — 3 +0.0222 Day — 2 +0.1894 Day — 1 —0.0663 Day + 1 +0.00114 Day + 2 
(3.745) (1.491) (0.424) (3.709) (- 1.334) (0.023) 


where R is the daily return and the effects of trading days —3 to +2 are 
separated out using 0, 1 variables for these coefficients. Only the pre-holiday 
with a mean return of 0.1894% per day is statistically significant with a ¢-static 
of 3.709. 

If we lump Fridays without Saturday trading and Saturdays in as special 
pre-holidays then the average return over the thirty-nine years is still 0.156% 
per day and the trading day after special and regular holidays has negative 
returns namely —0.048%. This is mostly the effect of the negative Mondays. 
See Kato, Schwartz and Ziemba (1989) or Kato (1990a). In Japan, Tuesdays 
are negative, on average, but not when there is Saturday trading the previous 
week. Then Monday is strongly negative, on average. These results are 
summarized in table 13. 

Table 14 shows that pre-holidays improve every day of the week. Lakonishok 
and Smidt (1989) found similar results in their 90-year Dow Jones Industrial 
Average study. Even Mondays and Tuesdays are positive if they are pre-holi- 
days. Wednesdays, Thursdays, Friday and Saturdays have extremely high 
returns if they are pre-holidays. These days have positive returns about 70% of 
the time. Saturday trading has varied considerably over the years with the 
trend towards fewer and fewer open Saturdays. During the sample period, 
Saturdays were open about 90% of the weeks. Saturdays were closed for 
trading as of February 1989. For more on the day-of-the-week effects in 
Japan, see Kato (1990a), Kato, Schwartz and Ziemba (1989) and Amihud and 
Mendelson (1989). 
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Table 14 
Mean daily return, standard deviation and percent positive returns on pre-holidays by days of the 
week of the NSA, 1949-1988. * 


Sample Mean Standard t- P- Median % Positive 
size retum, % deviation, Statistic Statistic return returns 
% 
All Mondays 1937 —0.589 ** 0.9500 -—2.73 0.0064 0.0189 50.9 
All Tuesdays 1959 —0.0419 0.9586 -—1.93 0.0532 -0.0373 47.4 
All Wednesdays 1960 0.1164 *** 0.9722 5.30 0.0001 0.1212 56.9 
All Thursdays 1959 0.0871 *** 1.0234 3.77 0.0002 0.0676 55.1 
Alt Fridays 1959 0.0544 ** 0.9174 2.62 0.0087 0.0700 54.6 
All Saturdays 1755 0.1409 *** 0.7043 8.38 0.0001 0.1483 61.9 
Mon. before holidays 59 0.3106 ** 0.7698 3.10 0.0030 0.3788 76.3 
Tues. before holidays 59 0.0103 0.7866 0.10 0.9198 0.1192 57.6 
Weds. before holidays 58 0.1600 1.0388 1.17 0.2456 0.1955 69.0 
Thurs. before holidays 60 0.1868 ** 0.7860 2.82 0.0064 0.2986 73.3 
Fri. before holidays n 0.3821 *** 0.7944 4.05 0.0001 0.3411 74.6 
Sat. before holidays 101 0.2758 *** 0.6242 4.44 0.0001 0.2468 64.4 
1 256 0.2097 *** 0.7324 4.58 0.0001 0.1574 62.5 
2: 1703 0.0310 0.9401 1.36 0.1731 0.0522 53.4 
3 18 0.2648 0.8151 1.38 0.1859 0.4115 77.8 
4 56 0.1895 * 0.5595 2.53 0.0141 0.2139 66.1 


1 = Fridays before a no-trading Saturday; 2 = Fridays before a trading Saturday; 3 = Fridays 
before no-trading Saturday and no-trading Monday; 4 = Saturdays before no-trading Monday 
(Saturdays before a Monday holiday). 


Source: Yamaichi Research Institute. 


Table 15 shows the effect of the pre-holidays as separate from the day-of- 
the-week effects. The regression estimates show that the pre-holiday effect is 
the largest, about 0.246% per day. Mondays and Tuesdays are negative as 
expected. Since these estimates use the entire 39 years of the NSA, Wednes- 
days’ returns are slightly less than Saturdays’ and these two days return the 
bulk of the week’s return. All the coefficients are significant at the 5% level. 


Table 15 
Day of the week and pre-holiday effects on the NSA, 1949-1988. 
Effect Mean Standard t-Statistic p-Statistic 
return, % deviation, 
% 
Monday —0.0705 ** 0.0214 — 3.29 0.0010 
Tuesday ~ 0.0435 * 0.0213 — 2.04 0.0412 
Wednesday 0.1151 *** 0.0213 5.40 0.0001 
Thursday 0.0807 *** 0.0213 3.79 0.0002 
Friday 0.0420 * 0.0214 1.97 0.0492 
Saturday 0.1327 *** 0.0229 5.81 0.0001 
Pre-holiday 0.2461 *** 0.0460 5.35 0.0001 


Source: Yamaichi Research Institute. 
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Table 16 
Mean daily return, standard deviation on pre-holidays, NSA, 1949-1988. 
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Holiday Advances Mean Standard Minimum Maximum (Statistic 
return, % deviation, return, % return, % 
% 
January 1 32/39 0.2457 0.7625 — 3.38 1.269 2.01 
January 15 32/39 0.4711 ** 0.7157 — 2.04 1.947 4.11 
February 11 15/22 0.3011 * 0.5326 — 0.42 1.810 2.65 
February 20 21/39 0.0687 0.6213 — 1,50 1.912 0.69 
April 29 25/39 0.2272 0.8313 — 2,96 2.731 171 
May 3 31/39 0.4989 *** 0.6700 —0.74 3.063 4.65 
May 5 30/39 0.3884 *** 0.6184 — 0.66 2.188 3.92 
September 15 14/22 0.0154 0.7309 — 2.48 0.670 0.10 
September 23 27/39 0.2341 ** 0.4866 —0.92 1.310 3.00 
October 10 12/22 — 0.2487 1.4389 — 5.41 1.256 ~0.81 
November 3 24/39 0.2987 * 0.8684 -1.27 2.770 2.15 
November 23 25/39 0.2414 0.9711 —2,18 2.585 1.55 
Source: Yamaichi Research Institute. 
Table 17 
Mean return per day around the time of the golden week NSA, 1949-1988. 
Date Sample Mean t-Statistic p-Statistic 
size return, % 
April 20 32 — 0.1285 —0.73 0.4691 
April 21 32 — 0.0234 —0.18 0.8617 
April 22 33 0.0576 0.34 0.6963 
April 23 34 — 0.0024 —0.01 0.9888 
April 24 33 0.1133 1.23 0.2291 
April 25 34 0.1339 0.92 0.3664 
April 26 33 0.2922 * 2.10 0.0433 
April 27 33 0.1277 0.74 0.4639 
April 28 34 0.2148 1.41 0.1685 
April 30 31 — 0.1418 — 0.47 0.6448 
May 01 33 0.4312 * 2.69 0.0113 
May 02 34 0.5359 *** 4.47 0.0001 
May 04 30 0.3255 ** 2.96 0.0060 
May 06 31 — 0.0976 — 0.60 0.5516 
May 07 34 0.0903 0.87 0.3892 
May 08 33 — 0.1326 -0.79 0.4260 
May 09 33 — 0.1174 —0.91 0.3671 
May 10 32 — 0.0350 -0.23 0.8170 
May 11 32 0.0282 0.28 0.7924 
May 12 33 — 0.0701 — 0.70 0.4905 
May 13 33 — 0.1336 — 1.02 0.3134 
May 14 33 — 0.2421 -1.75 0.0889 
May 15 31 — 0.2111 -1.70 0.0989 
May 16 32 — 0.0635 —0.50 0.6235 
May 17 32 — 0.2209 -1.39 0.1755 
May 18 33 — 0.3210 * — 2.20 0.0355 
May 19 34 — 0.2898 ~1.47 0.1509 
May 20 32 0.0081 0.06 0.9551 
May 21 32 0.3553 1.57 0.1274 
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Table 18 
Rates of return for the 39 turn-of-the years, NSA, 1949-1988, 
Trading Mean Standard t-Statistic p-Statistic 
day return, % deviation, 
% 
-15 0.0292 0.7566 0.24 0.8108 
-14 — 0,603 0.9834 —0.38 0.7037 
-13 — 0.2631 1.2660 -1.29 0.2022 
-12 0.2210 1.9265 0.72 0.4782 
-11 —0.1036 0.7214 — 0.90 0.3754 
-10 0.0202 1.2598 0.09 0.9209 
-9 —0.1197 1.3817 —0.54 0.5918 
-8 0.0347 1.2604 0.17 0.8643 
-7 0.1413 1.1679 0.76 0.4538 
~6 0.1738 0.8303 1.31 0.1990 
-5 0.1303 0.7532 1.08 0.2868 
-4 0.4683 *** 0.6526 4.48 0.0001 
-3 0.4413 * 1.1038 2.49 0.0170 
-2 0.0613 0.9883 0.39 0.7005 
-1 0.2457 0.7625 2.01 0.0513 
1 0.0369 1.0594 0.22 0.8289 
2 0.1539 1.0529 0.91 0.3670 
3 0.3812 1.2409 1.92 0.0626 
4 0.2460 1.0516 1.46 0.1523 
5 0.4067 * 0.9277 2.74 0.0093 
6 0.0856 0.7856 0.68 0.5005 
7 0.2546 1.0541 1.51 0.1397 
8 0.0985 0.8342 0.74 0.4655 
9 0.4007 ** 0.8602 2.91 0.0060 
10 0.2736 * 0.6824 2.50 0.0167 
11 0.2243 0.7166 1.95 0.0580 
12 0.0375 0.7256 0.32 0.7489 
13 0.1101 0.7036 0.98 0.3309 
14 0.2180 * 0.6674 2.04 0.0484 
15 ~ 0.0334 1.4171 —0.14 0.8837 


The standard deviations are coincidently extremely close for all days of the 
week except for the pre-holidays. 

Are some pre-holidays better than others? Table 16 investigates this for the 
twelve holidays. Most of the pre-holidays have high returns. The exceptions 
are September 15 which is just barely positive and October 10 which is 
negative. This is not surprising as the earlier research showed very poor 
returns in these months over the years. 


6. The golden week effect 


In late April and early May there are three holidays in a one-week period 
referred to as The Golden Week. The holidays are on April 29, May 3 and May 
5. Hence, one would expect that April 28, May 2 and May 4 would have 
strong returns. Also, in relation to the Christmas holiday period, as Lakonishok 
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and Smidt (1989) found for the DJIA in the U.S., other days in this period 
likely will have large returns as well. The results for the NSA 1949-1988 data 
displayed in table 17 shows that the three pre-holidays have high returns. April 
28 returns, on average, 0.215%, May 2 a whopping 0.536% and May 4 0.326%. 
The in-between day, May 1, a +1 day, and a —2 pre-holiday, returns an 
extremely high 0.431%. The post-holiday day is poor; not surprisingly it 
returns a negative 0.142%. It is an exception to the observed fact that the —1 
day of the month is extremely positive. A change in 1989 may affect this. This 
is the closing of the market on May 4 to make a three-day break. With only 
two instead of three pre-holidays the total return in this period may be 
sufficient for —1 to be strong as usual. This was the case in 1989 and 1990. 


7. Turn-of-the-year effect 


Based on our discussion of the turn-of-the-month effect, one would expect 
the turn-of-the-year effect to begin around day —5 and run to the middle of 
January. The data shown in table 18 indicate that the effect seems to start on 
day —7 when the mean return is 0.142% per day. The effect then has positive 
returns on every trading day until +14. Many of the days have phenomenal 
average returns: —4 is 0.468%, —3 is 0.441%, —1 is 0.246%, +3 is 0.381%, 
+5 is 0.407%, +7 is 0.255%, +9 is 0.401%, +10 is 0.274%, +11 is 0.224%, 
and +14 is 0.218%. This is with the 39 years of NSA data from 1949-1988. 
There are no small-firm indices readily available in Japan. Given the results 
discussed earlier that the small firms would probably have even higher returns. 
The cumulative average return on these 21 trading days is about 3% for the 
NSA. 

The —1 day, the final trading day of the year, is the start of the U.S. 
turn-of-the-month effect, how important is it in Japan? Its return, by itself, is 
about 0.198% per day, or more than four times the average day’s return of 
about 0.0476% as shown by the regression model: 


Mean return % t-Statistic p-Statistic 
bo Typical day 0.0476 *** 5.47 0.0001 
bi —I's 0.1981 1.32 0.1856 


where 


l 1 for — 1 day of the month 
b= ‘ 
0 otherwise 


The following regression model separates out the —1 of the year effect, 
from the —1 of each other month and the pre-holiday effect. The results show 
that each of these three effects is quite similar in size, returning about 0.2% per 
day. At least one third of the total mean returns amounts to ~—1’s and 
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pre-holidays which is similar to what Lakonishok and Smidt (1989) found for 
the Dow Jones Industrial Average over the 90 years from 1897 to 1986. 


Mean return t-Statistic p-Statistic 
bo Typical day 0.0341 *** 3.78 0.0002 
bi —1 of the year 0.2116 1.42 0.1568 
by — 1 of the month 0.1822 *** 3.95 0.001 
by Pre-holidays 0.2121 *** 4.30 0.0001 


b = p for last trading day of December 
0 otherwise 


, 


and pre-holidays ' 


1 for-1 day of the month except for January 
bz 
0 otherwise 


| 1 pre-holidays except-1 day of year 
0 otherwise 


8. Conclusion 


The results show that the seasonality regularities in Japan during the 
1949-88 period were quite similar to the corresponding effects in U.S. security 
markets. Differences occur because of alternative institutional and cultural 
patterns in Japan. For example, the turn-of-the-month effect which indicates 
higher returns on trading days —1 to +4 in the U.S. is paralleled in Japan 
with days —5 to +2. The returns on these days were about two thirds of the 
total monthly return. The rest of the return in each country was earned on 
days +5 to +9 in the U.S. and +3 to +7 in Japan. For the balance of the 
month in each country, the returns are statistically Jess than or equal to zero. 
The turn-of-the-year effect was also similar to that in the U.S. except it was 
longer in December and in January. 

The holiday effect is also similar with strong gains on pre-holiday trading 
days and negative returns on the post-holiday trading days. The Golden Week 
effect in early May is unique to Japan. It is interconnected with the holiday 
effect since during the sample study period 1949-88 there were three holidays 
during Golden Week. The strongest days of the year in Japan as in the U.S. 
are the pre-holidays and the —1 days for all twelve months. These days have a 
mean return of about 0.20% per day. 

The small stock January effect in Japan was similar to that in the U.S. 
despite the fact that there were no capital gains for individuals during the 
period. Japan also has a strong June effect for small stocks. Large bonuses 
paid to workers just prior to these stock rises seem to be at least a part of the 
cause of the high returns in these months. 

Students and practitioners of the Japanese stock market can utilize the 
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seasonal regularities for several purposes. One is in the construction of 
seasonality calendars that rank the days of the years on a +4 (the best) to — 4 
(the worst). This is discussed in Ziemba and Schwartz (1991). The evidence is 
that such seasonality calendars do show violations of market efficiency with 
the high rated day having significantly higher returns than the low rated days 
at high levels of significance. The seasonal regularities along with the small 
stock effect can also be used to construct anomaly portfolios which is also 
discussed in Ziemba and Schwartz (1991). 
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Chapter 12 


SEASONALITY EFFECTS IN JAPANESE FUTURES MARKETS* 


William T. Ziemba 


Yamaichi Research Institute, Tokyo 
University of Tsukuba and 
University of British Columbia 


This paper investigates seasonal regularities in the security price 
returns on the first section of the Tokyo Stock Exchange. The research 
uses data from the futures markets in Singapore for the Nikkei Dow 
225 index and in Osaka for the Kabusaki 50 index. The questions of 
main concern are whether or not the seasonal anomalies observed in 
the spot markets are maintained, are they anticipated in the futures 
markets, and do the futures market anticipations alter the character of 
the seasonal regularity. Results are presented concerning day of the 
week, monthly, holiday, turn of the month and year and first half of 
the month effects. The conclusions are tentative because the futures 
markets have only a short history to date. 


1. INTRODUCTION 


The Japanese country and economy were devastated in WWII. After the war 
the country was occupied and plans were set for a rebuilding. The stock 
markets reopened in 1949. The early years were difficult but the hard work 
and internal savings for investment paid off as the country's products gained 
more and more acceptance abroad. The full fruits of the economic miracle of 
Japan are now well known and feared in the west. Since 1980 the transfer of 
wealth to Japan from the U.S and others has been very large indeed. Figure 1 
shows this through the world stock market capitalizations in October 1980 and 
September 1988. Europe's share of the world's stock markets has stayed about 
constant (21% in 1988 versus 20% in 1980), the U.S. share has dropped from 


*Without implicating them I would like to thank my colleagues at the Yamaichi Research 
Institute particularly A. Komatsu and H. Shintani for their help and useful discussions on 
anomalous behavior in Japanese security markets. Thanks are also due to Warren Bailey and 
Sandra Schwartz for helpful comments on an earlier draft of this paper. This research was 
conducted at the Yamaichi Research Institute with the advice of William T. Ziemba. All the 
rights to the research belong to the Yamaichi Research Institute. I thank the Yamaichi 
Research Institute for permission to publish the results. This research was also partially 
supported by the Social Sciences and Humanities Research Council of Canada, and the Centre 
for International Business Studies, University of British Columbia. 
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53% to 31% and the rest of the world's is now 6% compared to 12% in 1980. 
Meanwhile, Japan's share has increased from 15% to 42% Net assets of Japan 
were $11.5 billion at the end of 1980 and those of the U.S. were $106.3 billion. 
By 1987 Japan's net assets grew over 20 fold to $240.7 billion. Meanwhile U.S. 
private and public assets fell by a negative $368.2 billion. There is much talk in 
the press about the U.S. trade and budget deficits. Indeed the public's deficit 
was $148.9 billion at the end of 1987. But the private sector of the U.S. is also 
$219.3 billion in the red. 


Rest 


Japan 


Rest 


Europe 


Japan 
Europe 


U.S. 


October 1980 September 1988 


Figure 1: Stock Market Capitalizations 


There has been a huge increase in Japanese stock prices. A standard measure 
of the market is the Nikkei Dow index. This index is analogous to the Dow 
Jones Industrial Average in the U.S. Its value at any time is the sum of the 
prices of the 225 stocks in the index divided by the current divisor which is 
adjusted over time to account for stock splits, rights offerings, etc. The index 
stood at ¥176.21 on the date of its original investment, May 16, 1949. By the 
end of the year the index had fallen to ¥109.9. But by the end of December 1988 
the index had increased to ¥30,159. The divisor which began at 225 was then 
10.289. The increase over the 38+ years was 171.15 times not counting 
dividends and taxes. In U.S. dollars the increase was a remarkable 489.59 
times. The increase has not been straight up. Over this period there have 
been twenty corrections of 10% or more and nine of over 20%. For details see 
Ziemba and Schwartz (1990). Table 1 shows the closing prices of the ND index 
plus the value of the yen/dollar exchange rate year by year plus the gains in 
yen and dollars for the ND index. 


The major stock market in Japan is the Tokyo Stock Exchange. The TSE has 
over 85% of the value and trading volume of the whole market which is also 
traded in Osaka, Nagoya and five other smaller exchanges. The stocks in the 
Nikkei Dow comprise about 50% of the market value and about 75% of the 
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Table 1: Yearly Yen Closing Prices of the Nikkei Dow and 
Yen/Dollar Exchange Rates, 1949-88 
Current Value of 
1¥ invested 1$ invested 


End of Year ND Yen/$ in 1949 in¥ in 1949 in$ 
1949 109.9 360.00 0.62 0.62 
1950 101.9 360.00 0.58 0.58 
1951 166.1 360.00 0.94 0.94 
1952 362.6 360.00 2.06 2.06 
1953 377.9 360.00 2.14 2.14 
1954 356.1 360.00 2.02 2.02 
1955 425.7 360.00 2.42 2.42 
1956 549.1 360.00 3.12 3.12 
1957 474.5 360.00 2.69 2.69 
1958 666.5 360.00 3.78 3.78 
1959 874.9 360.00 4.97 4.97 
1960 1356.7 360.00 7.70 7.70 
1961 1432.6 360.00 8.13 8.13 
1962 1420.4 360.00 8.06 8.06 
1963 1225.1 360.00 6.95 6.95 
1964 1216.5 360.00 6.90 6.90 
1965 1417.8 360.00 8.05 8.05 
1966 1452.1 360.00 8.24 8.24 
1967 1283.5 360.00 7.28 7.28 
1968 1714.9 360.00 9.73 9.73 
1969 2359.0 360.00 13.39 13.39 
1970 1987.1 360.00 11.28 11.28 
1971 2713.7 314.80 15.40 17.61 
1972 5207.9 302.00 29.56 35.23 
1973 4306.8 280.00 24.44 31.42 
1974 3817.2 300.95 21.66 25.91 
1975 4358.6 305.15 24.74 29.18 
1976 4990.8 292.80 28.32 34.82 
1977 4865.6 240.00 27.61 41.42 
1978 6001.8 194.60 34.06 63.01 
1979 6569.5 239.70 37.28 55.99 
1980 7116.4 203.00 40.39 71.62 
1981 7681.8 219.90 43.59 71.37 
1982 8016.7 235.00 45.50 69.69 
1983 9893.8 232.20 56.15 87.05 
1984 11542.6 251.10 65.50 93.91 
1985 13113.3 200.50 74.42 133.62 
1986 18701.3 160.05 106.13 238.72 
1987 21564.0 123.00 122.38 358.18 
1988 30159.0 125.85 171.15 489.59 

April 19. 1989 33185.15 132.36 188.32 512.22 


trading volume on the first section of the TSE. The second section has only a 
few percent of the value and volume so the first section and the Nikkei Dow 
give onea good indication of the whole market. The first section's 
capitalization is currently about Y490 trillion which is more than all the stock 
exchanges in the U.S. The ND index suffers from the same problems as the 
DJIA. A more representative index of the whole market is the TOPIX which is 
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a value weighted index of all, some 1135, stocks on the first section. It is 
analogus to the NYFE index in the U.S. Relative to the TOPIX, the ND has a 
beta (using data from Octover 1983 to October 1988) of about 0.84 so it is less 
variable than the overall market. The TSE is highly regulated with detailed 
and cumbersome listing procedures, price and margin limits and the like. 
Trading is dominated by the institutions including the life insurance 
companies and made largely through the big four: Nomura, Nikko, Daiwa 
and Yamaichi. Although there are some 45 foreign brokerage firms in Japan 
they handle only about 2% of the trading volume. In contrast, Nomura alone 
has about 30% of the trading volume. Although the foreign firms are very 
active in program trading and the like the lack of commission business has led 
to massive losses. Only ten of the 45 had profits in 1988 and the profits of 
these were small compared to those of the big four.! 


The Japanese stock market is not well understood and is therefore avoided in 
the U.S. Despite sharply rising prices and a firming yen, there was net selling 
by foreigners from 1984 to mid 1988. Fears of extraordinary high PE ratios, 
land prices beyond belief and the like abound as has been described in Ziemba 
and Schwartz (1990). A few key points are: 


¢ there are extraordinary high correlations between land prices particularly of 
commercial land and stock prices, see also Ziemba (1989b) 


+ much of the stock holdings are to cement business relationships and are 
never traded 


e there is a great preference by the Japanese corporations and individuals to 
accumulate land and a strong feeling not to sell it ever 


e the value of the stocks is much less, even with their high PE ratios based 
on trailing earnings of some 50-70 times, than the value of the land and 
stock holdings they own not to mention the value of the assets in 
businesses they are in. Tobin's Q, the percent of the assets that the stocks 
are worth is about 70% for the ND. 


> the PE ratios are not generally as useful a measure of stock value as in the 
U.S. because of the nontrading, undervaluation of land and other 
buildings, stock holding and other assets. Still there is a small but 
significant low PE effect separate from other valuation descriptors 


The impending crash that many U.S. and other foreign observers see as 
inevitable may take a while to materialize if at all. The confidence level of the 
Japanese in their markets is high and for good reason. Sharply rising earnings 
growth, GNP growth of about 6% in 1988, 4.5% in 1989 (estimated), low 


Japanese firms are not doing well in the U.S either. 
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interest and inflation rates, a strong yen, huge balance of trade and the like are 
cited as reasons (and used as explanatory variables in econometric prediction 
equations) for this strength. Japanese investors prefer to buy on weakness 
rather than on strength. That is the reverse of the majority of U.S. investors. 


The stock market is opening up quickly with a whole host of derivative 
instruments recently added and planned. There are now four index futures 
contracts trading and options on futures indices are scheduled to be traded 
later in 1989. Details on these contracts and their early histories appear in 
Ziemba and Schwartz (1990). 


There is not much literature yet on Japanese anomalies.3 Part of the reason for 
this is a lack of interest in such studies by the big Japanese brokerage firms who 
are unaware and suspicious of their potential use. Also these firms do not 
generally publish much of their research findings even in Japanese. Recently, 
however, U.S. and Japanese researchers have been studying these markets. 
The thrust has been mainly to ascertain the similarities and differences with 
the analogous results in U.S. markets. Some of the main references are 
Hawawani (1988a,b), Jaffe and Westerfield (1985a,b), Kato (1988a,b), Kato and 
Schallheim (1985), Kunimura (1984), Ikeda (1985, 1988) and Nakumura and 
Terada (1984). Kato, Schwartz, and Ziemba (1989) have surveyed the research 
and presented new results on day of the week effects in Japanese security 
markets. Other research works on the Japanese stockmarket are Bailey (1989), 
Brenner, Subrahmanyam and Uno (1987, 1988), Elton and Gruber (1988, 1989), 
Hamao (1988, 1989), Kunimura (1984), Hiraki, Aggarwal and Rao (1988), 
Komatsu and Ziemba (1989), Lau, Quay and Ramsey (1974), Roehl (1985), 
Pettway and Tapley (1984), Suzuki (1988) and Schoenfeld (1988). 


Ziemba (1989a) had access to data on the entire 38+ years of the ND and TOPIX 
indices and used it to study holiday, monthly, turn of the month and year, first 
half of the month and Golden Week effects on the first section of the TSE. 
There are strong and pervasive security market regularities that with some 
changes because of the institutions involved are analogous to many U.S. find- 
ings. The purpose of this paper is to investigate how the new index futures 
markets are affecting these anomalies. The specific data used are from two of 
these futures contracts: the SIMEX, ND225 futures traded on the Singapore 
International Monetary Exchange and the Kabusaki 50 index traded on the 
Osaka Stock Exchange. These contracts began trading on September 3, 1986 and 
June 6, 1987, respectively. The newest futures contracts, based on the TOPIX 
traded on the TSE, and the ND traded on the OSE began trading on September 


2For a survey of the U.S. derivative instruments and some of the theory behind them see 
Rubinstein (1987). 
3The U.S. literature on seasonal regularities is now extremely voluminous. Surveys and list of 


references appear in Schwert (1983), Keim (1986), Jacobs and Levy (1988) and the books by 
Dimson (1988) and Ziemba (1990). 
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3, 1988. In this paper we investigate the SIMEX, and Kabusaki data up to 
September 19 and 20, respectively, which is 557 trading days for the SIMEX and 
352 for the Kabusaki. 


Figure 2 shows the volume on the SIMEX and Kabusaki 50 during and after 
the period of this study. Similarly, Figure 3 shows the volume of the two new 
contracts. The main point is that the new contracts essentially replaced the 
Kabusaki 50 and its trading volume shrank to almost nothing and it has 
remained at this low level of activity. Meanwhile the SIMEX had an increase 
in volume part of which is increased programmed trading. The papers by 
Brenner, Subrahmanyam and Uno (1987, 1988) study arbitrage programmed 
trading strategies between the SIMEX and Kabusaki for the early part of the 
history before September 1988. See also Chapter 13 in Ziemba and Schwartz 
(1990). The Kabusaki is a price weighted package of 50 stocks traded on the TSE 
and OSE designed to track the ND. It has a daily limit price move of 3% in 
either direction. Hence in a market crash, it does not trade. The SIMEX has no 
such price limits and had a huge fall on crash day October 20, 1987. It now has 
a daily price limit change of 15% 


Singapore time is one hour behind Tokyo time, hence to correspond to 
Tokyo's 9:00 a.m. opening, the SIMEX begins trading at 8 a.m. During the time 
of this study, the TSE was open for trading from 9:00-11:00 a.m. and 1:00-3:00 
p.m., and 9:00-11:00 a.m. on the first, fourth and fifth (if there is one) Saturdays 
of the month. As of the beginning of February 1989 there was no Saturday 
trading on the TSE with trading in the afternoon from 12:30-3:00 pm with 
extra trading on the previously closed days December 28 and 29. SIMEX does 
not close for lunch and trades continuously from 8 a.m. to 2:15, an extra fifteen 
minutes after Tokyo's close. This is much the same as the extra time futures 
on the S&P 500 and other indices trade following the close of the New York 
Stock Exchange. Most of the trading days on the TSE and SIMEX are 
comparable but occasionally one of them has a different holiday. In the initial 
period until May 23, 1987 SIMEX did not trade on Saturdays. 


The SIMEX has a spot contract for the ND that is rarely traded, plus four 
futures contracts that mature on the quarter months, as do the Kabusaki and 
U.S. futures contracts, of March, June, September and December. The contracts 
mature on the third Wednesday of the month and they are settled in cash 
based on the closing value of the ND. The Kabusaki is settled on the fifteenth 
day of each contract month with the delivery of the basket of stocks. 
Throughout most of its trading history the SIMEX traded at a discount from 
fair value and often the Kabusaki was at a premium. There simply was not 
enough arbitrage trading to keep the prices in line. I refer the reader to the 
cited references and also Schoenfeld (1988) who among other 
accomplishments was the first westerner to trade on the SIMEX. 
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Figure 3: Trading Volume of the ND and TOPIX Futures Contracts Traded on 


the OSE and TSE, September 3, 1988 to January 31, 1989. 
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The plan of the paper is to describe in turn each of the spot anomalies and 
then investigate their current effects in the futures and spot markets. The 
conclusions reached are tentative because these data sets are still small but 
they are suggestive. In general, the anomalies still seem to be there, although 
they are anticipated in the markets to some extent and this has shifted their 
effects somewhat. 


2. DAYS OF THE WEEK EFFECT 


A survey of the days of the week effect in the spot Japanese stock markets 
along with some new results investigating the ND and TOPIX indices for the 
entire 1949-88 period appears in Kato, Schwartz and Ziemba (1989). The major 
research papers investigating the day of the week effects using data from the 
1970s and 1980s are Ikeda (1985, 1988) in Japanese, Kato (1988a,b), Kato and 
Schallheim (1988), and Jaffe and Westerfield (1985a,b). Some of the main 
conclusions of this research are: 


° There is a strong day of the week effect. Wednesdays and Saturdays are 
strongly positive, Thursdays and Fridays are mildly positive, Mondays are 
about neutral, and Tuesdays are especially negative. Until the end of 
January 1989, Saturday trading occurs for a half day on the first, fourth, and 
(if applicable) the fifth week of the month. 


e For the period 1978-87, Wednesdays gain 0.145% and Saturdays 0.14% each 
about triple the average gain of 0.058%. This is counterbalanced by the 
nearly even Mondays which return 0.004% on average and a loss of 0.09% 
on Tuesdays. Thursdays and Fridays return 0.065% and 0.105%, 
respectively. 


° Except for Saturdays which open up and rise all day, on average, most of 
the gains occur at night. Except for a brief rise early in the day and a little 
kick at the end all the other days have negative returns during trading 
hours. 


e Saturday trading affects the market in significant ways. Indeed the market 
seems affected by whether or not Saturday trading occurs. For example, 
with Saturday trading on the first, fourth and fifth weeks of the month 
during the sample period, the following Mondays are positive. But 
without Saturday trading Mondays are negative. Tuesdays are always 
negative except in the third week of the month when there is no Saturday 
trading both in this week and the proceeding week. In this case Mondays 
are especially negative losing 0.35% on average while Tuesdays gain back 
0.19% of this. 


e In total, Monday-Tuesday trading losses in Japan are about 0.09% which is 
similar to the Monday losses coupled with Tuesday's mildly positive gain 
in New York for a Monday-Tuesday return of about -.11%. Correlations 
between the New York and Tokyo's Dow's vary by time period but these 
relationships are more or less becoming stronger over time. What is 
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consistent is that New York's effect on Tokyo is stronger than the reverse. 
For the period 1980-87 the correlation between the previous day's return in 
New York and the close-to-open returns in Tokyo was 53.6%. The reverse 
effect, the correlation between Tokyo's close-to-close and New York's close- 
to-open is only about 11%. 


Table 2 sumarizes the recent effects using 1978-87 data and the effects of 
Saturday trading. 


Table 2: Effects of Saturday Trading on the TOPIX, April 4, 1978-June 18, 1987 


Trading Ends 
This Last Sample 
Week Week Week Mon Tues Wed Thur Fri Sat Size 


Ist,5th A1 Sat Sat 0.0198 -0.1072 0.1790 0.0581 0.0605 0.1678 1406 
4th A2 Sat Fri -0.1008 -0.1102 0.1074 0.0306 0.1582 0.0793 649 


2nd B1 Fri Sat 0.1135 -0.0649 0.0980 0.0933 0.1341 539 

3rd B2 Fri Fri -0.3489 0.1931 0.2479 0.2609 0.2193 58 
All Weeks 0.0039 -0.0902 0.1449 0.0648 0.1049 0.1397 2652 

Sample Size 449 464 464 465 467 343 2652 


(Source: Kato 1988a) 


Pieptea and Prisman (1988) and others such as Cornell (1985) have found that 
in the U.S. markets Mondays fall is fully anticipated in the futures markets 
using the S&P500 index. The futures market tends to fall in the last hour or so 
of trading so that even as the spot market is rising the futures are moving 
lower to anticipate Monday's decline. For the period March 18, 1983 to June 
27, 1986 Pieptea and Prisman found the following results: 


Mon Tues Wed Thur Fri 
Mean Annualized Return 
___in Spot Market : -4.93 42.33 3.16 23.50 32.59 
iie Standard Deviation 211 245 241 197 223 


Mean Annualized Return 
in Futures Market of the 


i pera n 19.35 9.10 483 23.57 6.75 
Me eee ee 2.09 2.46 2.48 2.07 2.12 


Figure 4 and Table 3 compare the day of the week effects on the spot ND in 
Tokyo with the futures market on the Nikkei Dow at the SIMEX in Singapore. 
Similarly Figure 5 and Table 4 compare the effects on the Osaka 50 futures, the 
Osaka 50 spot and the Nikkei Dow spot. 
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Table 3: Days of the Week Effects in the ND Spot Market in Tokyo and the 
SIMEX Futures Markets in Singapore, Sept 3, 1986 to Sept 19, 1988 


SIMEX futures ND spot 

Sample Mean % Sample Mean % 
Day Size Return t-Statistic Positive Size Retum t-Statistic 
Positive 
All 532 0.094 100 56.8 557 0.082 1.50 56.6 
Mon 102 -0.223 -1.52 46.1 100  -0.301* -2.93 44.0 
Tues 104 -0.137 -0.47 57.7 99 -0.120 -0.68 50.5 
Wed 101 0.029 0.20 58.4 100 0.251 1.74 60.0 
Thur 101 0.534 1.90 59.4 101 0.305* 3.08 64.4 
Fri 99 0.207 1.39 62.6 102 0.174 1.54 59.8 
Sat 25 0.383 1.09 56.0 55 0.258* 1.96 63.6 


In this and succeeding tables, * indicates that a coefficient is significantly 
different from zero at the 5% level, ** at 1% and *** at 0.1%. 


Figure 4: Days of the Week Effects in the ND Spot Market in Tokyo and the 
SIMEX Futures Markets in Singapore, Sept 3, 1986 to Sept 19, 1988 


Simex Futures 
HA ND Spot 
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Table 4: Days of the Week Effects in the ND Spot Market and the Osaka 50 
Spot Market in Tokyo and the Osaka 50 Kabusaki Futures in Osaka, 
June 6, 1987 to Sept 20, 1988 


Osaka 50 Futures Osaka 50 Spot 
Sample Mean % Sample Mean 
Day Size Return t-StatisticPositive Size Return t-Statistic % 
Positive 
All 352 -0.020 -0.30 49.4 353 0.000 0.00 52.1 
Mon 65 -0.329* -2.35 38.5 65 -0.347* -2.50 38.5 
Tues 62 0.205 1.20 54.8 63 -0.173 -0.62 60.3 
Wed 64 0.018 0.10 51.6 64 0.275 1.15 53.1 
Thur 63 -0.012 -0.09 50.8 63 0.173 1.31 55.6 
Fri 64 0.007 0.04 46.9 64 -0.025 -0.15 46.9 
Sat 34 0.018 0.08 58.8 34 0.190 0.90 64.7 
ND Spot 
Sample Mean % 
Day Size Return% t-Statistic 
Positive 
All 353 0.033 0.45 56.4 
Mon 65 -0.254 -1.93 46.1 
Tues 63 -0.196 -0.75 52.4 
Wed 64 0.318 1.57 59.4 
Thur 63 0.204 1.79 63.5 
Fri 64 0.059 0.39 56.3 
Sat 34 0.107 0.62 64.7 


Figure 5: Days of the Week Effects in the ND Spot Market and the Osaka 50 
Spot Market in Tokyo and the Osaka 50 Kabusaki Futures in Osaka, 
June 6, 1987 to Sept 20, 1988 
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The spot effect of weak Mondays and Tuesdays and strong Wednesday to 
Saturday is continuing. Wednesdays are the strongest days followed by 
Thursdays and Saturdays in the spot market. The Simex futures in Singapore 
look just like the spot market in Tokyo except in the Wednesday -Thursday 
period. Possibly this market is following rather than leading, although other 
evidence suggests the leading hypothesis. The futures seem to rise and fall 


Chapter 12: Seasonality Effects in Japanese Futures Markets 


303 


Seasonality Effects in Japanese Futures Markets 391 


with the spot. The Osaka 50 seems to anticipate the days of the week effect. 
Saturdays hardly rise at all in the futures markets despite the spot gains thus 
anticipating Monday's fall. Then these futures fall on Monday to anticipate a 
further fall on Tuesday. They then rise on Tuesday to anticipate the gains on 
Wednesday, Thursday, Friday and Saturday. They are then flat until the fall 
the next Monday. 


3. HOLIDAY EFFECTS 


The main references on the holiday effect in U.S. spot markets are Ariel (1988), 
Lakonishok and Smidt (1989), Zweig (1986) and the survey in Ziemba (1990). 
These authors found a very strong effect: days before holidays have 
significantly higher returns than other days. The effect is relatively strongest 
for large capitalized stocks although the total preholiday returns is higher for 
the small stocks because of the small firm effect. Indeed for the 90 year 
Lakonishok and Smidt DJIA sample from 1897-1986 these eight or so days 
return more than half the non-dividend return over the whole year. Trading 
days before the preholiday and after the holiday do not have significantly 
higher returns. Indeed these days particularly the post holidays have lower 
returns on average. Ziemba (1989a) has investigated similar effects on the 
TSE. In Japan there are about twelve holidays each year compared with about 
eight in the U.S. For the Nikkei Dow from 1949-88, I found that the mean 
daily returns could be estimated by 


R = 0.0352 + 0.0799 Day_5 + 0.0222 Day.) + 0.1894Day_, - 0.0663 Day ,; + 0.00114 Day 


+2, 
(3.745) (1.491) (0.424) (3.709) (-1.334) (0.023) 


where Day-3 is the dummy for the separate effect of trading day -3, etc. The 
pre-holiday with an extra mean return of 0.1894% per day is statistically 
significant with a t-static of 3.709 and none of the other days have returns that 
are significantly different from bg. The effect of the +1 day is negative, 
although the coefficient is not significant. Their total returns are slightly 
positive, 0.0068% which is less than a typical day. In total, the preholidays 
return 0.2246 per day versus 0.049% on a typical non-preholiday with lower 
risk measured by the standard deviation, 0.794% versus 0.979%. The 
preholidays also improve every day of the week, for the Japanese stocks see 
Ziemba (1989a), just as Lakonishok and Smidt (1989) found for the DJIA. 


Figure 6 and Table 5 show the spot holiday effect on the ND and on the SIMEX 
futures market for the period of the SIMEX data, September 3, 1986 to 
September 19, 1988 some 557 trading days. The preholidays have gains on 78% 
of the days which is much higher than for the other days. The mean returns 
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are high, averaging 0.276% per day which is nearly four times the return on 
other days. However, the t statistic of 1.57 is not significant except at the 13% 
level. Moreover, the third day before the holiday has even higher returns. 
The other days around the preholiday have low returns as expected. 


In the futures markets the SIMEX anticipates the holiday effect on the 3rd day 
before the holiday. This anticipation seems to affect the spot market so the 
spot prices rise on -3 taking out some of the gains that would normally occur 
on the preholiday. The return in the SIMEX futures market on the preholiday 
is in fact slightly negative again reinforcing the fact that the preholiday effect is 
anticipated. 


Table 5: Holiday Effect on SIMEX Sept 1986-Sept 1988 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Day Size Return% t-statisticPositive Size Returnn% t-statistic % Positive 
All 557 0.082 1.50 56.6 508 0.098 1.00 56.9 
3rd PH 21 0.370 1.38 57.1 17 0.684 1.62 64.7 
2ndPH 22 -0.083 -0.30 50.0 20 0.014 0.04 45.0 
PH 23 0.276 1.57 78.3 18 0.225 0.48 66.7 
AH 23 -0.001 0.00 56.5 21 0.445 1.25 61.9 
2nd AH 21 0.092 0.48 52.4 21 0.320 1.07 66.7 
Others 447 0.071 1.12 55.9 411 0.063 0.55 56.0 


Figure 6: Holiday Effect on SIMEX Sept 1986-Sept 1988 
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Figure 7 and Table 6 investigate the holiday effect from the Kabusaki Osaka 50 
index for the period June 6, 1987 to September 20, 1988. In the spot market on 
the TSE the effect is similar to the historical record. The preholiday returns 
average 0.50% per day and the index increases 75% of the time. The gain is 
significant at the 5% level. The third trading day before the holiday has high 
returns but they are not significant. In the futures markets the return is only 
significant at the 5% level on the preholiday. The returns were high around 
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the holiday periods in this sample with the days outside the pre and post 
holidays having negative returns. There is no evidence of the future market 
anticipating the holiday effect. 


Table 6: Holiday Effect on the Kabusaki 50 Sept 1986-Sept 1988 


Kabusaki 50 Spot on TSE Kabusaki 50 Future on TSE 
Sample Mean % Sample Mean 

Day Size Return t-StatisticPositive Size Return t-Statistic % Positive 
All 353 -0.001 0.00 52.1 352 -0.020 -0.30 49.4 
3rd PH 12 0.340 0.68 58.3 12 0.324 0.61 41.7 
2ndPH 11 0.021 0.04 45.5 11 -0.206 -0.42 45.5 
PH 12 0.499** 3.68 75.0 12 0.391* 2.27 66.7 
AH 12 -0.022 -0.07 50.0 12 0.284 1.03 50.0 
2nd AH 11 0.243 0.74 63.6 11 0.324 0.68 63.6 
Others 295 -0.043 -0.46 50.8 294 -0.070 -0.94 48.6 


In the spot market, the holiday effect for this sample of data of a year plus is 
not working in the usual fashion. The preholiday is strong but not dominant 
compared to the other days around the holiday. In fact for the SIMEX trading 
period the ND spot has risen most on the 3rd day before the holiday not on the 
preholiday. For the period that the Osaka 50 has been trading, the preholiday 
has the highest returns on the ND and these results are highly significant. 
The Osaka 50 also seems to have a strongly positive effect on the -3 preholiday 
trading day although these coefficients are not significant. So there may be 
some anticipation of the effect but with the small sample one cannot conclude 
much. 


Figure 7: Holiday Effect on the Kabusaki 50 Sept 1986-Sept 1988 
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4. TURN OF THE MONTH AND FIRST HALF OF THE MONTH EFFECTS 


The U.S. turn of the month effect occurs on trading days -1 to +4, see Ariel 
(1987) for an analysis of the spot data from 1963-82 and Sick and Ziemba (1989) 
for an analysis of futures data from 1982-88. The gains in this period are very 
high and contribute a very large percentage of the total monthly returns. The 
return in this week, coupled with that in the second week of the month, 
roughly trading days +5 to +8 or +10, essentially amounts to all the gains in 
the spot stock market during 1960-88. The rest of the month is essentially 
noise and at best provides zero returns. The reason(s) for this effect are not 
fully known but part of the story is that people receive their salaries on or 
around the -1 day as well as their stock account statements so they have funds 
to invest in stocks. There are also the portfolio balancing and renewal effects. 
When would one expect the turn-of-the-month to be in Japan? Many 
companies pay their salaries around the 25th of the month. Indeed, Ziemba 
(1989a) found similar turn of the month and first half of the month effects on 
the TSE. These effects are in the trading periods -5 to +2 and +3 to +7. For the 
ND from 1949-88 the period with trading days -5 to +2 receives the bulk of the 
month's return and each of these day's return is significant at the 5% level or 
better. Days +3 to +7 have positive returns and the rest of the month is noise; 
see Table 7 and Figure 8. Because of the pecularities of the Japanese market 
including Saturday trading day, +22 corresponds to a -5 day once a year or for 
39 times in that many years. Since Japan had more trading days because of the 
two or three Saturdays that were trading each month these periods are 
correspondingly lengthened. In the first half of the month the mean return is 
0.01142% per day while in the second half it is an insignifiant 0.00048% per 
day. 
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Table 7: Turn-of-the-Month Data, 1949-1988 


Trading Sample Mean t- Trading Sample Mean t- 
Da Si Return, % tatistic Day Si Return, % Statistic 
-5 471 -0899* 2.75 9 471 -1065* 2.46 
-4 471 -1041* 2.65 10 471 .0620 1.58 
-3 471 .1733"* 3.82 11 471 -0395 1.05 
-2 471 .1334** 3.18 12 471 -.0196 -0.50 
-1 471 -2255*** 5.36 13 471 .0115 0.24 
1 471 -0980* 2.43 14 471 -.0042 -0.10 
2 471 -1006* 2.47 15 471 -.0306 -0.54 
3 471 0307 0.72 16 471 .0716 1.35 
4 471 0592 1.15 17 471 -.0498 -1.20 
5 471 0358 0.81 18 429 -.0207 -0.49 
6 471 -.0005 -0.01 19 350 0162 0.36 
7 471 0357 0.09 20 229 -.0286 -0.48 
8 471 -.0585 -1.34 21 118 -.0476 -0.46 
22 39 .0562 0.45 


trading days 
ist Half 22-7 .01142 
2nd Half 8-21 00048 
Source: Yamaichi Research Institute reported in Ziemba (1989a). 


Figure 8: Turn-of-the-Month Data, 1949-1988 
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The turn of the month effect with high returns on days -5 to +2 seems to still 
be there but the SIMEX and Osaka 50 futures markets totally anticipate the 
effect. On the SIMEX, days -8 to -5 have all the gains of about 2.8% per month 
on average. This is a huge average gain in only four days. This is the whole 
turn of the month effect with the rest of the period about even. Although 
lower than in the futures markets, the spot market also has large gains on days 
-8 to -5. But the spot gains in the -5 to +2 period are still abnormally high. On 
the Osaka 50 futures market the anticipation seems to be on days -7, -6 and -5 
with a total gain of about 1%. Both markets lose back much of the gain on -4 
and -3. The effect in the spot market is similar to that mentioned above for 
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the ND. The SIMEX seems to lead the Osaka 50 futures and the spot ND in 
this effect. Table 8 and Figure 9 describe the results. Since there is only one or 
two turn of the years this data is lumped in with that for the other months. 
These were calculated separately, but this does not seem to change the results 


Table 8: The Turn of the Month Effect on the ND 


much. 
ND Spot on TSE 
Sample Mean % 
Day Size Return t-StatisticPositive 
-7 24 0.432* 2.17 66.7 
-6 24 -0.095 -0.40 62.5 
-5 24 0.333* 1.99 62.5 
-4 24 0.347* 2.00 62.5 
-3 24 -0.100 -0.40 54.2 
-2 24 0.315 1.10 70.8 
-1 24 0.161 0.79 50.0 
1 24 0.142 0.69 66.7 
2 24 0.689 0.31 54.2 
3 24 0.160 0.55 41.7 
4 25 0.261 1.61 68.0 
5 25 0.112 0.69 60.0 
6 25 -0.869 -0.64 40.0 
7 25 0.963 0.46 56.0 
8 25 -0.046 -0.02 56.0 
9 25 0.534* 2.50 76.0 
10 25 0.227 1.09 64.0 
11 25 -0.097 -0.47 56.0 
12 25 -0.341 -1.93 40.0 
13 25 -0.019 -0.10 48.0 
14 24 0.154 0.79 58.3 
15 24 -0.601 -0.92 54.1 
16 24 0.446 1.04 62.5 
17 24 0.022 0.92 62.5 


Sample 
Size 


ND Futures on SIMEX 
Mean 
Return 

22 0.668* 2.18 
21 -0.061 -0.19 
22 0.625* 1.96 
23 -0.356 -1.02 
18 0.445 0.15 
20 0.349 0.84 
18 -0.448 -0.90 
19 0.256 0.76 
21 0.314 1.32 
24 0.355 0.99 
21 0.103 0.45 
24 0.244 1.15 
20 -0.106 -0.51 
23 -0.438 -0.14 
25 0.252 0.97 
25 0.297 1.40 
24 0.358 1.75 
25 0.158 0.07 
25  -0.383* -2.12 
24 0.043 0.17 
22 0.933 0.35 
23 -0.923 -0.72 
21 0.012 0.07 
24 1.472 1.32 


t-Statistic % Positive 


68.2 
57.1 
54.5 
47.8 
61.1 
65.0 
33.3 
57.9 
61.9 
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Figure 9: The Turn of the Month Effect on the ND 
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t-Statistic % Positive 


Osaka 50 Futures on OSE 
Sample Mean 
Size Return 
15 0.270 1.26 
15 -0.082 -0.21 
15 0.702 1.41 
15 -0.548 -1.45 
15 -0.253 -0.82 
15 0.131 0.27 
15 0.054 0.19 
15 -0.062 -0.16 
15 0.121 0.40 
15 0.120 0.24 
15 0.086 0.29 
15 0.049 0.19 
15 -0.305 -1.27 
15 0.117 0.28 
15 -0.037 -0.20 
16 0.361 1.62 
16 0.446 1.86 
16 0.028 0.18 
16 -0.259 -1.33 
16 -0.015 -0.07 
16 -0.274 -1.11 
14 0.007 0.02 
15 -0.068 -0.24 
15 -0.255 -0.69 
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Figure 10: The Turn of the Month Effect on the Osaka 50 
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5. FIRST HALF OF THE MONTH 


The effect that most or all of the gains on the TSE occur on trading days -5 to 
+7 seems to be working according to the script in the spot market. Itis as well 
in the Osaka 50 futures market. The SIMEX futures though seems to at least 
partially smooth out the returns. This first half has higher returns by a 50% 
margin but it does not have all the gains. The following strategy would have 
been profitable during the sample period: buy the SIMEX on the close of -9, 
sell on the open of -5; on the Osaka 50, buy on the close of -8 and sell on the 
open of -5. 


Table 10: Test of the hypothesis that first half of month has all the returns. 
half N Mean % Std Dev Std Error 
1 292 15 1.023 0.0598 
2 265 .008 1.355 0.0942 


The F test with 264 and 291 degrees of freedom indicates that with F=2.25 the 
variances are equal at the 0.1% level. The t test with equal variances then 
indicates that the first half of the month has all the gains with 19.24% 
confidence using a pooled variance with 555 degrees of freedom. 
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Table 11: The First Half of the Month Effect on the ND 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Day Size Return% t-Statistic Positive Size Return t-Statistic % Positive 
All 557 0.082 1.50 56.5 508 0.098 1.00 56.9 
ist Half 292 0.150* 2.51 57.1 253 0.119 1.29 54.9 
2nd Half 265 0.008 0.08 55.8 255 0.077 0.45 58.8 


Figure 11: The First Half of the Figure 12: The First Half of the Month 
Month Effect on the ND Effect on the Osaka 50 


All 1st Half end Half 


EE] ND Sporon TSE $ ND Futures on 
SIMEX 


tst Half 


E Spot on TSE E Futures on OSE 


Table 12: The First Half of the Month Effect on the Osaka 50 


Spot on TSE Futures on OSE 
Sample Mean % Sample Mean 
Day Size Return t-StatisticPositive Size Return. t-Statistic % Positive 
All 553 -0.000 -0.00 52.1 352 -0.020 -0.30 49.4 
Ist Half 180 0.034 0.37 52.8 180 0.018 0.17 48.9 
2nd Half 173 -0.036 -0.25 51.4 172 0.060 -0.71 50.0 


6. THE MONTHLY EFFECT 


Because of its length of about 30 days one would expect the futures markets to 
closely mirror the spot markets over a monthly period. Research on the spot 
market in Japan appears in Hawawani (1988b), the data of Horimoto (1988) 
reported in Ziemba (1989a) along with additional data reported there. These 
results are summarized in Table 13. 
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Table 13: Monthly Effects on the Tokyo Stock Exchange 


Hawawani (1988) Horimoto Ziemba (1989a) 
373-566 stocks (1988) ND 225 1949-88 
Jan 1955-Dec 1985 All TSE 1st Price Weighted 
Equally Weighted Section, 1965-87 
Equally 
Weighted 
Average Mean Mean Mean 
Monthly Sample Monthly t Monthly Sample Daily t 
Return Over Size __ Retur _ statistic Return Size Return _ statistic 
All months 372 1.60%* 6.40 1.76 11529 0.0482*** 5.56 
all but January 341 1.24%*** 4.75 1.42 na na na 
January 31 5.58%*** 13.17 5.44 885 0.1816*** 5.80 
February 31 1.00 1.85 1.71 905 0.0549 1.87 
March 31 2.90%** 3.77 2.88 986 0.0457 1.50 
April 31 0.51 0.57 0.82 947 0.0623* 2.10 
May 31 0.18 0.20 1.29 956 0.0074 0.27 
June 31 2.51%*** 4.59 2.83 1011 0.0641* 2.39 
July 31 0.65 0.75 1.12 1043 0.0083 0.27 
August 31 0.92 0.85 1.16 1038 0.0790** 2.96 
September 31 0.13 0.17 0.28 931 0.0059 -0.22 
October 31 0.52 0.48 0.17 998 0.0088 0.24 
November 31 1.81 1.76 1.11 910 0.0371 1.27 
December 31 2.48%** 3.02 2.27 919 0.0470 1.37 


January, March, June and December have had the highest mean returns in 
that order in both Hawawani's and Horimoto's samples. Also September and 
October had the lowest returns. Ziemba's longer data set shows similar very 
high returns in January, and the low September and October returns. But 
there are differences in the other months from the Hawawani and Horimoto 
samples. The sample sizes for Horimoto's data are unavailable. 


There is only one January in the Osaka 50 data and two in the SIMEX data but 
the daily data used do give us some results that are significant at the 5% level. 
Tables 14 and 15 and Figures 13 and 14 describe the results. 
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Table 14: The Monthly Effect on the ND Spot and the SIMEX 
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56.6 
69.0 
65.1 
64.6 
60.9 
55.8 
54.2 
57.1 
53.1 
57.4 
44.9 
57.1 


ND Spot on TSE ND Futures on SIMEX 
Sample Mean % Sample Mean 
Month Size Return t-StatisticPositive Size Return t-Statistic % Positive 
all days 532 0.094 1.00 56.8 557 0.082 1.50 
Jan 39 0.380* 1.97 61.5 42 0.396* 2.24 
Feb 40 0.226" 2.06 62.5 43 0.232* 2.26 
Mar 44 0.205 1.79 68.2 48 0.164 1.53 
Apr 41 0.254 1.34 63.4 46 0.272 1.87 
May 37 0.204 0.90 59.5 43 0.154 1.02 
jm 46 -0.003 -0.02 60.9 48 -0.032 -0.29 
Jul 48 -0.002 -0.01 50.0 49 0.064 0.40 
Aug 45 0.128 0.87 46.7 49 0.066 0.65 
Sep 57 -0.079 -0.47 54.4 54 -0.027 -0.20 
Oct 47 -0.242 -0.26 51.0 49 -0.283 -0.64 
Nov 43 0.204 0.85 58.1 42 0.134 0.65 
Dec 45 0.014 0.08 48.9 44 -0.066 -0.52 


Figure 13: The Monthly Effect on the ND Spot and the SIMEX 
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Table 15: The Monthly Effect on the Osaka 50 
Osaka 50 Spot on TSE 


ND Spot on TSE 
Sample Mean 


Month Size Return t-StatisticPositive 


Positive 
all days 353 0.033 
Jan 21 0.465 
Feb 20 0.284** 
Mar 24 0.167 
Apr 23 0.204 
May 21 -0.014 
Jun 40 -0.086 
Jul 49 0.064 
Aug 49 0.066 
Sep 36 0.020 
Oct 25 -0.351 
Nov 21 -0.118 
Dec 22 -0.226 


Osaka 50 Futures on OSE 
Sample Mean 


Month Size Retum t-Statistic 


Positive 

all days 353 -0.020 
Jan 21 0.358 
Feb 20 0.244* 
Mar 24 0.162 
Apr 23 0.170 
May 21 -0.280 
Jun 40 0.076 
Jul 49 -0.081 
Aug 49 0.097 
Sep 36 0.008 
Oct 25 -0.494 
Nov 21 -0.121 


Dec 22 -0.256 


% 
0.45 56.4 
1.42 66.7 
3.21 72.7 
1.20 58.3 
1.87 65.2 
-0.09 52.4 
-0.69 52.5 
0.40 57.1 
0.65 53.1 
0.15 61.1 
-0.43 56.0 
-0.31 47.6 
-1.08 36.4 
% 
-0.28 49.4 
0.95 52.4 
2.68 68.2 
1.04 50.0 
1.14 65.2 
-1.50 42.9 
-0.65 47.5 
-0.47 51.0 
0.86 46.9 
0.05 55.6 
-0.84 33.3 
-0.27 42.9 
-0.92 36.4 


Sample 
Size 


Mean 
Return 


-0.000 
0.410 
0.254* 
0.124 


t-Statistic % 


-0.00 
1.07 
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They describe this pattern as a form of the gambler’s fallacy, the belief that if an event 
just occurred, then the likelihood that it will occur again falls.!5 


6.2. Inefficiencies with Unpopular Numbers 


Fixed payoffs for lotteries are not the only possibility. Pari-mutuel payoffs are used 
by all states for lotto games and by Massachusetts for its numbers game. The pari- 
mutuel method allows a state to guarantee its percentage take by having the payoff 
to winners decreasing in the number of winners. Given that all numbers are equally 
likely,!® no system can be developed that will improve the likelihood of winning any 
of the lotteries that have been described. However, if a numbers or lotto game employs 
pari-mutuel payoffs, then by choosing unpopular numbers, upon winning, one is likely 
to share the given prize with fewer other winners. If some numbers are sufficiently 
unpopular, bets with positive expected return may exist, despite the lottery’s low payout 
rate. Chernoff’s (1980, 1981) study of the Massachusetts number game, where players 
pick a number from 0,000 to 9,999, found that numbers with Os, 9s, and to a lesser extent 
8s, tended to be unpopular. He showed that by concentrating on the unpopular numbers, 
bets with a positive expected return were possible. Clotfelter and Cook (1991) provided 
some evidence of this, too, with three days of 1986 data from Maryland’s three-digit 
numbers game. The most popular three-digit choice was 333 which was 9.93 times 
more common than the average. The seven most popular choices were all triples—333, 
777, 555, 444, 888, 666, and 999—and all were at least five times more popular than the 
average number. The least popular was 092, picked 0.23 times as often as the average 
number, and was followed in unpopularity by 086, 887, 884, and 968, all 0.25 times as 
popular as the average. 

Lotto, with its possibility of prizes of tens of millions of dollars, is one of the most 
popular games, and it has received the most media attention. It involves matching six 
numbers drawn without replacement from 50 or so total possible numbers. If T is the 
total possible numbers and D is the number drawn, then the probability of matching is 
1 in T!/(D\(T — D)!). So, for example, the probability of winning when six numbers 


'SMetzger (1985) considered the gambler’s fallacy at the racetrack, and found support for the hypothesis that 
betting on the favorite should be more attractive after a series of longshots have won than after a series of 
wins by favorites. 

16Johnson and Klotz (1993), on the basis of 200 Lotto America winning combinations, suggest that each 
number may not be equally likely. They find that, roughly, smali numbers are drawn more frequently than large 
numbers. They suggest that it may be a consequence of the mechanical mixing process, that small-numbered 
balls are dropped into the urn first. 
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are drawn from 49 is 1 in 13,983,816. Most games have prizes for matching fewer than 
all the drawn numbers, too, but it is common for half the prize money to go to the grand 
prize. The long odds mean that none of the perhaps millions of bettors might win in a 
given week (the usual period over which lotto is played). In this event, the grand prize 
jackpot is carried over to the next week. ZBGS studied whether unpopular numbers and 
the carryover can allow a profit. Using several methods, they determined that there were 
unpopular numbers, they were virtually the same ones year to year, and they tended to 
be high numbers (non-birthdays, etc.) and those ending in Os, 9s, and 8s. For instance, 
a regression method based on actual payoffs generated the following as the 12 most 
unpopular numbers: 32, 29, 10, 30, 40, 39, 48, 12, 42, 41, 38, and 18. They were 1,530% 
less popular than average. The most popular number, 7, was selected nearly 50% more 
often than the average number. Using a maximum entropy distribution approach, Stern 
and Cover (1989) identified 20, 30, 38, 39, 40, 41, 42, 46, 48, and 49 as the 10 most 
unpopular numbers while 3, 7, 9, 11, 25, and 27 were the six most popular.!” 

ZBGS showed that expected returns of $1.50 without carryover and up to $2.25 with 
carryover per dollar bet are possible. Does this imply that lotto games can be profitable, 
though? To see that it may not, consider a hypothetical game where you pay $1, choose a 
number between 1 and 1,000,000, and if your number matches the one that is randomly 
selected, then you win $2,000,000. In spite of your edge, you are likely to go bankrupt 
before winning the jackpot. A reduced wager will increase the likelihood that you will 
eventually hit the correspondingly reduced jackpot before you go bankrupt, but your 
expected wealth will suffer. MacLean et al. (1992) analyzed this problem using a model 
contrasting the growth of wealth and the security of wealth and found that lotteries are 
an impractical way for modestly endowed investors to enhance their long-term wealth. 
For instance, by wagering an optimally small amount each round, one’s initial stake 
can be increased tenfold before losing half the stake with a probability close to one. 
However, millions of years of wagering are required, on average. For example, consider 
the hypothesized data in Table 7 and the results in Figure 7. With a more attractive set 
of prizes, the probability is arbitrarily close to one for sufficiently small wagers (see 
MacLean et al., 1992). 

Rather than make optimally small wagers in the face of small probability gambles, 
growth may be improved by increasing the probability of success. For lotteries, this 
can be accomplished by buying more than one combination of numbers. It may even be 
possible in the face of a substantial carryover to profitably purchase most, or perhaps all, 
of the combinations. There have been times when this would have been profitable. In 
practice, though, the transaction costs are enormous because tickets must be purchased 
one at a time. Furthermore, there is the worry that others might also be covering all the 
numbers, to your joint detriment.'® 


"7 See also Joe (1987). Clotfelter and Cook (1991) provided another example of popular numbers from Mary- 
land’s lotto, which has 40 total possible numbers. On the particular day they analyzed, players picked the 
1-2-3-4-5-6 combination over 2,000 times more frequently than the average pick. Had this been the winning 
combination (at a chance of one in 3,838,380), winners would have collected only $193.50! 

18A related opportunity arises with horse racing pick-sixes (pick the winners of six consecutive races) if there 
are substantial carryovers. Covering all pick-six possibilities is easily accomplished at the track and may be 
profitable if few others behave likewise. 
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TABLE 7 Lotto 6/49 Data 


Prizes Probability Value Contribution 
Jackpot 1/13,983,816 $6,000,000 42.9 
Bonus 1/2,330,636 $800,000 34.3 

5/6 1/55,492 $1,000,000 9.0 

4/6 1/1,032 $5,000 14.5 

3/6 1/57 $150 17.6 
Edge 18.1% 
Kelly bet 0.0000001 1 
Number of tickets with $10,000,000 bankroll 11 


Source: MacLean et al. (1992). 
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FIGURE 7 Lotto 6/49—Probability of multiplying before losing half of one’s fortune vs. bet size. 
Source: MacLean et al. (1992). 


Lotto typically involves drawing six numbers. Different states have different total 
possible numbers, though, resulting in very different probabilities of winning. In 1990, 
the extremes were one chance in 974,000 (36 numbers and two picks per ticket) in 
Delaware and one chance in 22,957,480 (53 total numbers and one pick per ticket) in 
California (Cook and Clotfelter, 1993, p. 635). Cook and Clotfelter (1993) explain this 
as a trade-off that states must make between the size of the jackpot and a player’s esti- 
mate of the likelihood that he or she will win. The former is easily learned through 
advertisements and the media. The latter, according to Cook and Clotfelter, is gener- 
ally not well understood but tends to be based on the frequency with which someone 
wins (p. 634). Thus, Delaware could increase its total possible numbers to 53 as in 
California but, given its population, on average there would be many weeks between 
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winners. This would lower the public’s view of the likelihood of winning and the 
attractiveness of purchasing a ticket. On the other hand, given California’s population, 
even with 53 total possible numbers there will usually be a winner each week. This 
nonrational means of probability assessment causes a scale effect whereby per capita 
expenditure increases with the population base of the lottery. Smaller states cannot 
exploit this scale effect themselves but can through forming consortia with other states, 
as happens with the Tri-State lottery (involving Maine, New Hampshire, and Vermont) 
and the states constituting Lotto America. 
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Abstract 


This chapter examines whether the favorite-longshot bias that has been found in 
gambling markets (particularly horse racing) applies to options markets. We investigate 
this for all options on the S&P 500 futures and the FTSE 100 futures for the 17+ years 
from March 1985 to September 2002. Calls on the S&P 500 with both three months and 
one month to expiration display a relationship between probabilities and mean returns 
that are very similar to the favorite bias in horse racing markets. There are slight profits 
from deep in-the-money and at-the-money calls on the S&P 500 futures and increasingly 
greater losses as the call options are out-of-the-money. For three-month and one-month 
calls on the FTSE 100 futures, the favorite bias is not found, but a significant longshot 
bias has existed for the deepest out-of-the-money options. For the put options on both 
markets, and for both three-month and one-month horizons, investors overpay for all put 
options as an expected cost of insurance to protect against downside risk. The patterns 
of mean returns is analogous to the favorite-longshot bias in racing markets. 


JEL Classifications: C15, G13 


Keywords: Longshot bias, gambling, option prices, implied volatilities 


1. INTRODUCTION 


Griffith (1949), McGlothin (1956), Snyder (1978), Ali (1979), and others have doc- 
umented a favorite-longshot bias in racetrack betting.’ High probability-low payoff 
gambles have high expected value and low probability—high payoff gambles have low 
expected value. For example, a 1/10 horse having more than a 90% chance of winning 
has an expected value of about $1.03 per $1 bet, whereas a 100/1 horse has an expected 
value of about 14 ¢. The favorite-longshot bias exists in other gambling markets such as 
sports betting; see Hausch et al. (1994) for a survey of results. 

In Ziemba and Hausch (1986), the expected return per dollar bet versus the odds 
levels are studied for over 300,000 horse races. The North American public underbets 
favorites and overbets longshots. This bias has appeared for many years across all sizes 
of racetrack betting pools. The effect of these biases is that for a given fixed amount 
of money bet, the expected return varies with the odds level; see Figure 1. For bets on 
extreme favorites, there is a positive expected return. For all other bets, the expected 
return is negative. The favorite-longshot bias is monotonic across odds or, equivalently, 
the probability of winning and the drop in expected value is especially large for the 


'While the horse racing favorite-longshot bias is quite stable and pervasive, there exist exceptions in Asian 
racetrack markets (Busche and Hall, 1988; and Busche, 1994). The favorite-longshot bias literature is sur- 
veyed in Hausch et al. (1994, 2008) where many papers are reprinted including the early studies of Griffith 
(1949) and McGtothin (1956). See also the survey of Sauer (1998). Recent papers consistent with the usual 
bias are Hurley and McDouough (1996), Sobel and Raines (2003), and Ottaviani and Sørensen (2003) plus 
the chapters in this volume. 
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FIGURE 1 The effective track payback less breakage for various odds levels in California and New York 
for 300,000 plus races over various years and tracks. 
Source: Ziemba and Hausch (1986). 


lower probability horses. The effect of differing track take/transaction costs is seen in 
the California versus New York graphs. 

Thaler and Ziemba (1988) suggest a number of possible reasons for this bias. 
These include bettors’ overestimation of the chances that longshot bets will win as 
in Kahneman and Tversky (1979). Tversky and Kahneman (1983) argue that bettors 
might overweight small probabilities of winning when the potential payout is large (in 
calculating their utility). Bettors may derive utility simply from the hope associated 
with holding a ticket on a longshot, as it is more fun to pick a longshot to win over a 
favorite and this has more bragging rights. Transaction costs also play a role. Finally, 
they suggest that some bettors may choose horses for irrational reasons, such as the 
name of the horse. Other explanations are that the bias results from the complexity 
of the wagers and the information available to bettors and not from risk-preferences; 
see Sobel and Raines (2003). Ottaviani and Sgrenson (2003), Hurley and McDonough 
(1996), Quandt (1996), and Shin (1991, 1992) provide theoretical models that attempt 
to explain the bias. See also the chapters in this volume, especially Ottaviani and 
Sgrensen (2008). The reasons for the effect varying over time are a combination of 
several factors. These include: (1) the utility bettors gain from betting on longshots and 
the associated preferences over the skewness of returns; (2) the systematic tendency 
for bettors to overestimate the chances of low probability outcomes, and underestimate 
high probability outcomes, and (3) the type and information aspects of informed and 
noise bettors in the race involved. Sobel and Raines (2003) show that the bias is steeper 
for lower quality races compared to higher quality races even on the same day at the 
same track. Consistent with this, Ziemba and Hausch (1987) show that the bias for the 
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Kentucky Derby is much less than is typical. The 1986 graph in Figure | is more flat in 
2007 until the tail dropoff for high odds horses. On this, see, in this volume, the chapters 
by Snowberg and Wolfers (2008) and Ziemba (2008). 

Puts and calls on stock index futures represent leveraged short or long positions on 
the index and their behavior might have similar features to racetrack bets. Demand for 
options comes from both hedging and speculation. The primary use of put options is for 
hedging. 

For the call options, the most obvious hedging demand is to sell them against existing 
holdings of equity. This covered call strategy tends to depress the price of (especially 
out-of-the-money) call options. If this were the sole mechanism for dealing in call 
options, this should result in an increase in the expected return for purchasers of out-of- 
the-money call options. Coval and Shumway (2001) considered the expected and actual 
returns for options on the S&P 500. They find that call options (in a fairly narrow range 
around the current underlying S&P 500 price) have a higher expected return relative to 
the underlying S&P 500 index market. While this result is consistent with a leverage 
effect (the beta of options being much larger than the beta of the S&P 500), the return 
remains less than what it should be if leverage were the sole factor. Coval and Shumway 
(2001) do not consider the investment in deep out-of-the-money call options on the S&P 
500 as we do here.” 

We find significant expected losses for such deep out of the money call options. This 
could be due to speculative activity similar to that for longshot horse race bets. However, 
Bollen and Whaley (2003) showed that buyer-initiated trading in index puts dominates 
the market. Because there are few natural counter-parties to these trades (apart from 
hedge funds), the implied volatilities of these options rise and the implied volatilities 
of the corresponding call options rise due to put-call parity. However, they show that 
the primary choice of buyer-initiated index put trading occurs for the nearest out-of- 
the-money put options. They also stated, “since portfolio insurers generally buy OTM 
puts rather than ITM puts,” this implies that relatively speaking the demand for in-the- 
money puts is less and given that they argue that option mispricing is due to supply and 
demand imbalances at different strike prices, then in-the-money puts would be relatively 
less expensive. By put-call parity, this implies that the costs of the out-of-the-money call 
options would be relatively less expensive and offer a higher return. Nevertheless, our 
results indicate that deep out-of-the-money call options are overpriced. 

Rubinstein (1994) pointed out that the implied volatilities for options on the S&P 
500 changed after the 1987 stock market crash with the prices of out-of-the-money 
put options rising and the prices of out-of-the-money call options falling (relative to 
the price of the at-the-money option). This implied volatility skew (or smile) effect 
has been an active area of research. Buraschi and Jackwerth (2001, p. 523) con- 
clude, “returns on out-from-the-money options are driven by different economic factors 


*During January 1990 to October 1995, which was the period of the Coval and Shumway (2001) analysis, 
the average underlying S&P 500 futures price was approximately 430. They examined puts with strikes 15 
points below and calls with strikes 10 points above. This implies an average percentage difference of strike 
prices that were 3.5% below the current price for puts and 2.3% higher for calls. In this study, we examined 
all traded options with ranges +43.3% of the current S&P 500 price. 
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than those relevant for at-the-money options.” We consider the returns for such deep 
away-from-the-money put options (as opposed to the near-the-money options consid- 
ered by Coval and Shumway, 2001). 

With market imperfections (such as transaction costs or other frictions that disallow 
riskless hedges to be constructed in continuous time) or incomplete markets, option 
prices are no longer uniquely determined by arbitrage, and may be determined (within 
limits) by supply and demand. Dumas et al. (1998) suggest that the behavior of market 
participants may be the reason for the existence of smiles. They state: “with institu- 
tional buying pressures for out-of-the-money puts and no naturally offsetting selling 
pressure, index put prices rise to a level where market makers are eventually willing 
to step in and accept the bet that the index level will not fall below the exercise price 
before the option’s expiration (i.e., they sell naked puts) . . . option series clienteles may 
induce patterns in implied volatilities, with these patterns implying little in terms of the 
distributional properties of the underlying index” (p. 21). 

Figlewski (1989) suggests that volatility smiles exist because of the demands of 
option users. He suggests that the higher prices (and resulting higher implied volatil- 
ities) associated with out-of-the-money options exist because people simply like the 
combination of a large potential payoff and limited risk. He likens out-of-the-money 
options to lottery tickets with prices such that they embody an expected loss. Never- 
theless, this does not dissuade some from purchasing them.? This would suggest that 
investors might be acting irrationally. Poteshman and Serbin (2002) show that this is 
the case for the exercise of exchange-traded stock options. They conclude that the early 
exercise of American calls on stocks during the period of 1996-1999 was in many 
instances “clearly irrational without invoking any model or market equilibrium.” If 
investors act irrationally in this regard, it is also possible they also act irrationally when 
assessing the value of the option and could display similar irrational behavior to other 
speculative endeavors such as gambling. 

We examine the returns from investing in call and put options on stock index futures 
markets and assess whether the mean returns are biased for high leverage situations, 
as they are in various betting markets. To test the hypothesis that options display such 
biases requires a sufficient number of independent observations in actively traded mar- 
kets and a broad enough range of strike prices where such low probability options are 
quoted. We use stock index futures options data, as these markets have existed for a 
sufficiently long period of time to yield enough independent exercise cycles and the 
range of offered strike prices allow the entire probability spectrum to be spanned. These 
instruments may be dominated by institutional investors buying portfolio insurance (as 
suggested by Bollen and Whaley, 2003). Given that such speculative behavior may be 
more likely in option markets with more retail activity, it could be helpful to exam- 
ine individual stock option markets in parallel. However, either such individual stock 
option markets may not have been as actively (and consistently) traded as the stock 


3The purchase of overpriced out-of-the-money puts may be justified by the desire of investors to provide a 
smoother risk profile in volatile markets. 
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index options, yield sufficient independent expiration cycles,’ and often do not offer 
a sufficiently wide range of strike prices to examine extreme probability events, we 
exclude their analysis here. In any case, Bakshi et al. (2003) suggest that the exclusion 
of individual stock options in our study will not seriously affect our general conclu- 
sions. They consider the skew effect for individual equity options written on the S&P 
100 (OEX) cash index and its 30 largest equity components. Using 1991-1995 data, 
they show that the risk-neutral index skews exist and are a consequence of risk aversion 
and fat tailed distributions. Given that they report that individual stock skews are flatter 
than the index skews, this suggests that the option pricing bias is not greater in mar- 
kets with more retail customers (i.e., individual stock options) but is in fact less. These 
equity markets have less systematic risk premium than the index markets and that is 
reflected in the less steep skew. 

As equity index option markets have a wider range of available strike prices and 
trade on a monthly expiration cycle, yielding more independent trials than for stock 
options, we restrict our analysis solely to these markets. We examine two markets that 
have slightly different levels of retail trading activity. Our first is the S&P 500 futures 
options market. According to the Marketing Department of the Chicago Mercantile 
Exchange (and from Large Position reports from the Commodity Futures Trading Com- 
mission) virtually all trading activity for options on the S&P 500 futures comes from 
institutional traders. For options on the Financial Times Stock Exchange (FTSE) 100 
futures traded at the London International Financial Futures Exchange (LIFFE) there 
is more retail involvement. Press releases from LIFFE report that retail involvement in 
these options comprise up to 10% of the total volume (similar to that of the individual 
stock options traded on the LIFFE). This market will provide some insights into the 
impacts of non-professional trading on the favorite-longshot bias. 

Section 2 presents data sources and the methodology for the transformation of option 
prices into odds, so that the results can be compared to the horse racing literature. 
Section 3 presents results for the S&P 500 and FTSE 100 options markets. Section 4 
concludes. 


2. METHODOLOGY 


To investigate whether a favorite-longshot bias exists in option markets requires a trans- 
formation of option prices into odds. In the Black Scholes (1973) equation, N (d2) is 
the forward price of a digital option that pays $1 if F > X. It is the (risk neutral) odds 
at which investors can bet on this event. For a put option, the digital that pays $1 if 
X < Fis N(—dz). As with the racing studies, one must collect a large sample of inde- 
pendent events, determine the odds of certain events occuring, invest a fixed amount in 
each bet (say $1), and examine the a posteriori payoff of that bet. A pool of bets with 
the same odds must be aggregated and the mean payoff returns calculated. Our data 


‘Individual Stock Options are only offered on a quarterly cycle. Given that Stock Index options have had 
monthly expirations since 1987, there are more independent observations to test the hypotheses. 
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is the publicly available settlement prices for the futures contracts and all call and put 
options on the S&P 500 and FTSE 100 index markets on those dates when the options 
had either exactly one month or three months to expiration. The period of analysis 
was the 17.5 years from March 1985 to September 2002 and yielded 69 indepen- 
dent quarterly observations for the S&P 500 and FTSE 100 futures. S&P 500 futures 
began trading on April 21, 1982 and puts and calls on the S&P 500 began trading on 
January 28, 1983. The early years had little volume and few strike prices. Hence, 
our dataset covers the vast bulk of options trading in the S&P 500. For the monthly 
observations (serial options), there were 187 independent observations for the S&P 
500 futures and 124 observations for the FTSE 100 index options markets. The data 
were obtained from the Chicago Mercantile Exchange for the S&P 500 futures and 
options. These option contracts are American style options on futures. The data for the 
FTSE 100 futures and options were obtained from the LIFFE for the European style 
options on futures from 1992 to 2002 and from Gordon Gemmill for the American style 
options on futures prior to 1992. The interest rate inputs were obtained from the British 
Bankers Association (U.S. Dollar or British Pound LIBOR). 

Monthly and quarterly data were used instead of daily data to ensure independence of 
the observations and final outcomes. We identified all expiration dates for all available 
options over the sample period. On that day, we recorded the settlement levels of the 
futures contract (the nearest to the expiration of the futures contract and possibly the 
cash index if that date was a simultaneous expiration of the futures and options contract), 
and all available option prices on this nearby futures contract that had either one month 
or three months to expiration. 

Given that settlement prices were used, it was not necessary to conduct the standard 
filtering procedures such as butterfly arbitrages; see Jackwerth and Rubinstein (1996). 
However, we did remove all options with prices below 0.05 (as for a trade to take place 
the offer price must be at least 0.05). With 17 years of quarterly data, we had 69 quar- 
terly observations in our analysis with an average of 39.1 available strike prices per 
observation for the options on the S&P 500 and 30.8 strikes for options on the FTSE 
100. For the monthly expirations, the average number of strike prices available for the 
S&P 500 options was 39.0 and 28.6 for the FTSE 100. 

The first step is to calculate a measure of the odds of options finishing in the 
money (analogous to the odds in horse racing). Since the options are American, the 
Barone-Adesi and Whaley (1987) approximation has been used to recover the implied 
volatilities, which have then been substituted into the Black (1976) formula to calculate 
the pseudo-European option probabilities [N(d2) and N(—d>)]. For the European style 


>The pit committee of the CME determines the settlement prices rather than by market transactions and this 
could impact our results (especially for OTM options). However, the actual price at the end of the trading 
day could be a bid, mid, or offer price. Given that our analysis considers the payoffs from purchasing options, 
if the actual price that could be dealt at was the bid or mid price (rather than the offer price we implicitly 
assume), the payoffs of the options would be reduced accordingly. Therefore, our estimates of the wealth 
relatives for buying OTM options are more likely to be overly optimistic. 

®To examine the impact of the 1987 crash, we also analyzed the post-crash period of our dataset. The results 
were not materially different (apart from small reductions in the mean wealth relatives of out-of-the-money 
put purchases). 
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options on the FTSE 100, the Black (1976) implied volatilities were directly used.” In 
all markets, the implied volatilities for each option were used to calculate the odds. To 
make a more consistent comparison with horse race betting, the premium for the options 
were expressed in forward value terms. Using Black’s (1976) formula 


Cry = FN(d,) — XN(d2) (la) 
Py, = XN(—d,) — FN(—a2). (1b) 
where 
qe In($) + 40° (T-t) 
ovr 
and 


d,=d, -ov(T =t). 


As we only observe the current option prices C,, and P,,, we transform these to the 

results in Equations (1a) and (1b) by multiplying the observed prices by e”7-® (where 

r is the LIBOR interpolated between adjacent standard maturities as reported by the 

British Bankers Assocation on the observation date, f, and T is the expiration date). 
The terminal payoffs of the options are 


Cr = MAX (Fr — X,0) and Pr = MAX (X — Fr,0), (2) 


respectively. We calculate the wealth relatives of the ratios of these to the initial option 
forward values: in the absence of risk premiums these would be expected to average 
to one. 

An important issue in averaging them is how the wealth relative on each option 
should be weighted. In our data sample, the number of strikes available increases with 
time. We would therefore lose efficiency if we weighted all options equally, as this 
would correspond to investing increasing amounts over time, where, for a given day the 
returns on options at different strikes are not independent. Our first principle is therefore 
to weight each monthly or quarterly period equally, by investing a fixed amount of 
money (e.g., $1) at each date. 

To achieve the same investment amount for the alternative option contracts, the 
number of options purchased equals 


Qc = $1/Cy, and Qp = $1/ Pry, (3) 


respectively, for all calls and puts. Equation (3) suggests that for higher priced options 
(e.g., in-the-money), the quantity purchased will be small and for lower priced options 


TTo be strictly comparable with horse racing odds, we should calculate the cost of a digital option under the 
distribution implied from option prices. However, it is not clear which of the variety of parametric and non- 
parametric approaches for the determination of implied distributions would be most appropriate. In any case, 
this would introduce another possible source of error. 
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(e.g., out-of-the-money), the number of options purchased will be large. We interprete 
the in-the-money options as the favorites and the out-of-the-money options as the 
longshots. 

Unlike horse racing, the (risk neutral) probabilities of payoff in the options markets 
are not expressed as odds but in a continuous probability range from 0% to 100% (and at 
random points). In horse racing, while the bets are expressed as odds, such bets actually 
represent a continous probability range for all bets between discrete categories (and are 
rounded down). As examples, 9/5 bets cover all ranges from 1.80 to 1.99 to 1 and 5/2 
bets covers all bets from 2.00 to 2.49 to 1. 

To determine expected wealth relative at fixed “odds” levels [N(d2) or N(—d2)], 
we use interpolation to estimate what strike and option price would apply. Within the 
range of “odds” that exist on a given day, we linearly interpolate the implied volatility 
between adjacent strikes. With each wealth relative estimated thus, we form a sim- 
ple average of wealth relatives from non-overlapping periods, and can therefore easily 
perform significance tests.” 

Standard significance tests (such as a one-tailed t-test) may be inadequate when the 
sample distribution is not normal. The holding period return distributions of options 
tend to be quite positively skewed, and particularly so for out-of-the-money options, and 
when a risk premium on the underlying increases (for calls) or reduces (for puts) the 
(objective) probability of exercise. Care is therefore needed in testing the significance 
of the mean wealth relative to any given null hypothesis. To address this, we conducted 
Monte Carlo simulations to obtain the distribution of the realized mean wealth relatives 
for samples of suitable sizes (60 for quarterly and 160 for monthly horizons).'° These 
simulations were done under Black Scholes assumptions, with and without a risk pre- 
mium, and for one-month and three-month times to expiration. The confidence intervals 
obtained in this way were noticably different from the t-test intervals that would have 
been applicable for a normal (or nearly normal) distribution. !! 


8We only interpolate and do not extrapolate beyond the range of traded strikes. 

°We compute mean expected payoffs for various odds (probability of finishing in the money) bets to be 
comparable to the racetrack literature. One could beta risk adjust these bets to possibly separate out risk from 
behavorial biases. Coval and Shumway (2001) show how to do this. They find that there are then negative 
expected returns from buying puts and calls. This is consistent with our story that long run expected profits 
accrue to option sellers rather than option buyers. However, the Shumway and Coval results show that if 
you risk adjust the expected return from buying in-the-money 3 month calls is most likely negative and not 
positive as shown in Figure 3. 

10The Monte Carlo simulation entailed simulating 10,000 times the average wealth relative for 60 (quarterly) 
and 160 (monthly) payoffs for call and put options with N(d2) from 0.05 to 0.95 in 0.05 increments. The 
standard error of the wealth relative was determined and the appropriate confidence levels were determined. 
For the inclusion of the risk premium, the negative continuous dividend adjustment to the Merton (1973) 
model proposed in the following section was used to determine the ratio of the expected wealth relative 
compared to the Black Scholes (1973) option price. 

1I Nevertheless, because the confidence intervals most affected are for the right hand tails of out-of-the-money 
options (which tended not to be observed empirically) the use of the simulated intervals makes little difference 
to our results. 
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3. RESULTS 


The first step is to examine what the payoffs of call and put options would be under the 
Black Scholes (1973) model. Although the presence of a risk premium on the equity 
index does not affect the option valuation, it will affect the pattern of realized wealth 
relatives. When risk premiums exist (e.g., in equity markets, see Constantinides, 2002), 
the expected return for the investment in options will differ from the $1 investment. 
Similar in spirit to Coval and Shumway (2001), we examined the expected theoretical 
returns for call and put options using the Black Scholes (1973) formula with no risk 
premiums and risk premiums of 2%, 4%, and 6%. This is done by using —2%, —4%, and 
—6%, respectively, as the continuous dividend rate, using the Merton (1973) dividend 
adjustment, in the Black Scholes formula. The ratios of the option prices are determined 
and plotted as a function of money. This can be seen in Figure 2 for call and put options. 
The calls lie above the $1 investment and the puts lie below the $1 investment. 
Consistent with the theoretical results of Coval and Shumway (2001), who show that 
in a very general setting, call options written on securities with expected returns above 
the risk-free rate should earn expected returns that exceed those of the underlying secu- 
rity and put options should earn expected returns below that of the underlying security. 
They also show that under very general conditions, these divergent expected returns 
would be increasing with the strike price (degree of out-of-the-moneyness). With this 
guidance as to how we would expect option returns to behave as a function of the 
Black Scholes (1973) model with risk premiums, we can now assess the returns actu- 
ally observed for options on the S&P 500 and FTSE 100 futures. The results appear in 
Tables 1 and 2 for three-month options. The call and puts options appear on the left-hand 
and right-hand sides, respectively. For both, the first column is the odds of finishing in 
the money as measured by N(d2) or N(—d2). The next column indicates the number 


Expected Wealth Relatives: Calls and Puts 


ree 2% Risk Premium 
— 4% Risk Premium 
—*— 6% Risk Premium 


Np) 
N(~ ab) 
10 09 08 07 06 05 04 03 02 0.1 0.0 


FIGURE 2 Expected wealth relatives for call and put options with alternative risk premium levels. 
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TABLE 1 Mean Return per $1 Bet vs. Odds Levels: Three-Month Options on S&P 500 Futures, 1985-2002 


Call options on the S&P 500 futures 


Put options on the S&P 500 futures 


Average SDof 

Odds (%)  #Obs payoff payoff t-test vs. $1 Odds (%) 
.95-1.00 47 1.0010 0.3204 0.02 -95-1.00 
90-95 60 1.0561 0.4605 0.95 -90-.95 
.85-.90 66 1.1231 0.5704 1.76°* .85-.90 
.80-.85 67 1.1407 0.6990 1.66** .80-.85 
.75—.80 63 1.0938 0.5953 1.25 .75—.80 
10-75 64 1.1366 0.7732 1.41* -70-.75 
.65-.70 62 1.1461 0.8648 1.33* .65—.70 
.60-.65 59 1.1311 0.9972 1.01 60-.65 
55-60 58 1.1727 1.1154 1.18 .55-—.60 
.50-.55 54 0.9890 1.0410 -0.08 .50-.55 
.45-.50 56 1.1365 1.3925 0.73 .45-.50 
40-45 58 1.2063 1.6012 0.98 .40-.45 
.35—.40 51 0.9770 1.7015 -0.10 .35-.40 
30-35 54 0.9559 1.6041 -0.20 .30-.35 
.25—.30 59 1.2923 2.7539 0.81 .25-.30 
.20~.25 53 1.1261 2.5378 0.36 .20-.25 
15-20 55 0.8651 2.0742 -0.48 .15-.20 
10-15 56 1,2262 3.6982 0.46 10-15 
.05-.10 53 1.5085 5.3370 0.69 .05-.10 
.00-.05 39 0.0123 0.1345 -44,.89"*** .00-.05 
All All 
options 69 1.1935 2.4124 0.67 options 


#Obs 


37 
44 
50 
54 
53 
51 
53 
54 
50 
56 
51 
56 
56 
62 
64 
65 
64 
66 
66 
57 


69 


Average 
payoff 
0.8998 
0.8662 
0.8426 
0.7937 
0.8137 
0.7879 
0.7702 
0.6215 
0.8225 
0.5807 
0.7344 
0.6785 
0.4744 
0.6257 
0.6316 
0.6426 
0.6696 
0.6602 
0.6432 
0.7525 


0.6212 


SD of 
payoff 
0.4493 
0.5872 
0.7265 
0.8120 
0.8950 
0.9979 
0.9648 
1.0258 
1.2458 
1.1377 
1.4487 
1.5367 
1.2383 
1.6791 
1.8231 
1.9854 
2.2441 
2.6359 
3.4256 
5.6025 


2.5247 


t-test vs. $1 


-1.35* 
-1.50° 
-1,53* 
-1.86** 
-1.51* 
-1.51* 
-1.73* 
-2.70**** 
-1.01 
~2.76"*** 
-1.31* 
-1.57* 
-3,19**** 
-1.76** 
-1.62* 
~1.45" 
-1.18 
-1.05 
-0.85 
0.33 


-1.25 


of observations we have for that particular 5% band (i.e., days for which $1 could be 
invested). The average payoff for a $1 investment in that particular option band appears 
next and is followed by the standard deviation of the option payoffs within the band. 
The final column is a modified one tailed t-test of the hypothesis that the mean return is 
equal to the initial investment of $1 using 


where 


t= (x: = $1) / (sii) 


n 
X=} Xy nand X;; 
j=l 


(4) 


(5) 
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Xij is the wealth relative of the jth option in the ith continuous “odds” range, and À is 
the equity risk premium. Critical levels for the t-test were determined using a Monte 
Carlo simulation. When the hypothesis is rejected at a 90% level or above, the t-statistic 
appears in bold print, “*”, “**,““***", or “**** on the t-statistic indicates that the 
level of significance is greater than the 90%, 95%, 97.5%, or 99% level, respectively. ! 

Figures 3 and 4 provide a graphical view of the mean returns, related to in-the- 
moneyness, in these markets. These are not plots of the data in Tables 1 and 2, but were 
calculated by our continuous interpolation method. This was done solely to allow 


TABLE 2 Mean Return per $1 Bet vs. Odds Levels: Three-Month Options on FTSE Futures, 1985-2002 


Call options on the FTSE futures Put options on the FTSE futures 
Average SDof t-test Average SD of t-test 
Odds (%)  #Obs payoff payoff vs. $1 Odds (%)  #Obs payoff payoff vs. $1 
.95-1.00 32 1.0294 0.3215 0.52 .95-1.00 29 1.0019 0.5058 0.02 
.90-.95 38 1.0485 0.4830 0.62 .90-.95 38 0.8995 0.6101 1.02 
.85-.90 41 1.1025 0.5901 1.11 .85-.90 36 0.8564 0.7274 = 1.19 
80-.85 43 1.1033 0.7033 0.97 -80-.85 37 0.9628 0.8862  —0.25 
.75-.80 44 0.9531 0.6601 —0.47 .75-.80 40 0.9709 0.9221  —0.20 
.70-.75 49 0.9473 0.7491  —0.49 .70-.75 37 0.9201 1.0829 -0.45 
.65-.70 47 1.1151 .0764 0.73 .65-.70 40 1.0430 1.1861 0.23 
.60-.65 49 0.8999 0.7903 -0.89 .60-.65 43 0.8264 1.1006 —1.03 
.55-.60 44 1.1142 .1296 0.67 .55-.60 38 0.9276 1.3428  —0.33 
.50-.55 45 0.9505 1.2324 —-0.27 .50-.55 39 0.8525 1.3050 -0.71 
.45-.50 44 1.0148 .1783 0.08 .45-.50 48 0.8615 1.5273  —0.63 
.40-.45 41 0.8594 1.1062 ~0.81 .40-.45 43 0.8764 1.7370  —0.47 
.35-.40 43 1.1381 8821 0.48 35-40 48 0.7311 1.4967 -1.25 
30-.35 43 0.6177 1.1931  —2,10"** 30-35 44 1.0169 2.2145 0.05 
.25-.30 47 1.0396 2.1356 0.13 .25-.30 53 0.7216 2.2611 —0.90 
.20-.25 38 0.8813 1.9081  —0.38 .20-.25 49 0.6252 1.9079  —1.37 
.15-.20 0 0.4773 1.3779 -2.40*** -15-.20 48 1.0081 3.3628 0.02 
10-15 42 0.9025 2.6841 —0.24 10-15 46 0.4131 1.9507 —2.04""" 
05-10 37 0.1421 0.7891 -—6.60"*** 05-.10 44 0.3600 2.2526 -1.88** 
.00-.05 35 0.1877 1.1102 —4,.32"""" .00-.05 38 0.0893 1.0420 —5.390"" 
All All 
options 70 0.9983 1.4668 —0.01 options 70 0.6016 1.6203 —2.05*** 


12The confidence intervals used were based on a risk premium of 1.75% per quarter, which was the average 
realized risk premium for the two stock index markets over the period of our analysis. Thus, à in Equation 
(4) is 1.0175 for quarterly options and 1.00583 for monthly options. 
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3-Month Stock Index Futures Call Options Wealth Relatives 


-2.3 
L20 

18 
L 1.5 
1.3 
1.0 
Log 
Los 
Log 


- 0.0 
1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0.0 


FIGURE3 Mean return per dollar bet vs. odds levels: three-month stock index calls, 1985-2002. 
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FIGURE 4 Average return per dollar bet vs. odds levels: five-month stock index puts, 1985-2002. 
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continuous, smooth curves to be drawn (and does not alter our interpretation of the 
results for the ranges in Tables 1 and 2). 


3.1. Results for Quarterly Options on Stock Index Futures 


For the call options on the S&P 500 futures, we find a similar favorite-longshot bias as 
in horse racing. The deep in-the-money call options in the probability ranges of 65% to 
90% have a mean return of $1.058. For the remaining ranges from 5% to 65%, we can- 
not reject the hypothesis that the return is significantly different from the $1 investment. 
For the deepest out-of-the-money calls (0—5%) the mean returns are only 1.23 cents 
per dollar invested. We reject the hypothesis of an expected return of $1 for the lowest 
5% at a 99% level or above. This result supports the hypothesis of Figlewski (1989) 
that out-of-the-money call options are seen by investors is lottery tickets and investors 
overpay for deep out-of-the-money call options on the S&P 500 futures. Thus, the liter- 
ature on “excessive optimism” in the assessment of risky situations may apply here; see 
Kahneman and Tversky (1979) and Tversky and Kahneman (1983). 

For the call options on the FTSE 100 futures, for call options in the range between 
80% to 100% probabilities, there is no significant difference between the expected pay- 
off and the initial $1 investment. Likewise, for most of the range from 35% to 80%, we 
cannot reject the hypothesis that the return is significantly different from the $1 invest- 
ment. However, for most of the out-of-the-money calls with probabilities less than 35%, 
we reject the hypothesis of an expected return of $1 at above a 99% confidence level. 

The put options on both the FTSE 100 and S&P 500 futures (essentially) all have 
negative mean returns. Moreover, the mean payoff is decreasing as the probabilities 
decrease, analogous to the horse racing favorite-longshot bias. This is also consistent 
with the contentions of Rubinstein and Jackwerth (1996), Dumas et al. (1998), and 
Bollen and Whaley (2002) that investors view put options as insurance policies and are 
willing to accept an expected loss to protect their holdings of equity against downside 
risk losses. To provide a clearer comparison between our results and those of Ziemba 
and Hausch (1986), the figures use similar axes: probabilities equal the reciprocal of the 
odds plus one. This can be seen for sets of stock index options in Figures 3 and 4. 

This allows direct comparison to Figure 2, that presents the theoretical relationship 
between an option’s expected returns and risk premiums. If risk premium was causing 
call options returns to return more than the $1 investment, we would expect Figure 3 
to resemble the upper portion of Figure 2. When the returns are expressed as wealth 
relatives, out-of-the-money options offer a lower rate of return—exactly the opposite 
of what we expect. Therefore we conclude that the mechanism at work is not the risk 
premium argument of Coval and Shumay (2001) but a favorite-longshot bias. 

In Figure 3, in-the-money call options yield more than the $1 invested in each option. 
This is not surprising, given the existence of a risk premium for the equity market.'? 
However, the overall pattern is surprising: we would expect all calls to offer a higher 
rate of return, and for this to increase as the odds lengthen, as in Figure 2. For put options 
on the stock index futures in Figure 4, the mean return tends to decrease, as the option 


13n the probability ranges from 45% to 55%, our results are similar to those of Coval and Shumway (2001). 
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is further out of the money. This is more consistent with Figure 2, but still suggests 
some anomalous behavior. For the S&P 500 put option returns, it is the in-the-money 
ones that are significant, whereas for the FTSE 100 options, only the out-of-the-money 
options are significant. In all cases, the returns on the longshot options are much more 
variable than on the favorites. Thus a much larger deviation of the sample mean from 
one is required, for a given number of observations, in order to reject the hypothesis. 


3.2. Results for Monthly Options on Stock Index Futures 


An enlargment of the data for the index options occurs when one considers options on 
futures with monthly expirations. This also allows a comparison with the three-month 
terms to expiration discussed above. The results appear in Tables 3 and 4 for the one- 
month calls and puts for the S&P 500 futures and FTSE 100 futures, respectively. 


TABLE 3 Mean Return per $1 Bet vs. Odds Levels: One-Month Options on S&P 500 Futures, 1985-2002 


Call options on the S&P 500 futures 


Put options on the S&P 500 futures 


Odds (%) 


.95-1.00 
.90-.95 
.85-.90 
.80-.85 
.75-.80 
.70-.75 
.65-.70 
.60-.65 
.55-.60 
.50-.55 
45-50 
40-45 
35.40 
30-.35 
.25~.30 
20-25 
15-.20 
.10-.15 
.05-.10 
.00-.05 


All 
options 


#Obs 


187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 


{87 


Average 


payoff 


1.0092 
0.9938 
1.0029 
0.9796 
1.0064 
0.9346 
0.9693 
0.9656 
0.9196 
0.9586 
0.8954 
0.9204 
0.9671 
0.8673 
0.9927 
0.7939 
0.9257 
0.7585 
0.6940 
0.50958 


0.9668 


SDof t-test 
payoff vs. $1 Odds (%) 
0.2506 0.50 -95-1.00 
0.3923 —-0.22 90 —.95 
0.4877 0.08 .85-.90 
0.5925 -0.47 .80-.85 
0.6762 0.13 .75—.80 
0.7612 -1.18 -70-.75 
0.8699  —0.48 65-.70 
0.9497  —0.50 .60—.65 
1.0671  —1.03 .55-.60 
1.1004  ~0.51 50-55 
1.2820 ~1.12 45-.50 
1.3652 ~0.80 40~.45 
1.5108 ~0.30 -35-.40, 
1.6712 ~1.09 -30-.35 
1.8245 ~0.05 .25—.30 
1.8764 —1.50* .20-.25 
2.5402 —0.40 .15-.20 
2.8601 —1.15 10-15 
3.5704 —1.17 05-10 
3.9119  ~-1.71*" .00-.05 
All 
2.1085 0.22 options 


#Obs 


187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 
187 


187 


Average 


payoff 


0.9792 
0.9883 
0.9989 
0.9544 
0.9880 
0.9437 
0.9520 
0.9193 
0.8867 
0.9217 
0.8146 
0.9064 
0.7672 
0.7987 
0.7454 
0.6910 
0.6101 
0.5303 
0.4039 
0.0508 


0.5033 


SD of 
payoff 
0.4949 
0.6677 
0.7746 
0.8778 
0.9814 
0.9879 
1.0734 
1.2257 
1.2654 
1.4020 
1.4427 
1.6557 
1.6412 
1.9158 
2.0390 
2.1639 
2.3346 
2.4037 
2.3630 
0.9785 


1.3827 


t-test 

vs. $1 
—1.87** 
—2.01*"" 
—1.18 
—1.67* 
-0.61 
-1.54 
—0.74 
-0.97 
—1.48* 
—0.72 
~—1.54* 
-1.17 
—2,53*** 
-1.64* 
-2,38"** 
-2.55 
-3.287 * 
—3.11**** 
—3.52"*** 
-14.49"" 


-492"""" 
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TABLE 4 Mean Return per $1 Bet vs. Odds Levels: One-Month Options on FTSE Futures, 1985-2002 
Call options on the FTSE futures Put options on the FTSE futures 
Average SDof t-test Average SDof t-test 

Odds (%) #Obs payoff payoff vs. $1 Odds (%)  #Obs payoff payoff vs. $1 
.95-1.00 123 0.9595 0.2694 —1.67* .95-1.00 123 1,0105 0.4717 0.25 
.90-.95 123 0.9719 04011 -0.78 .90-.95 123 0.9847 0.6005 —-0.28 
.85-.90 123 0.9596 0.5020 -0.89 -85-.90 123 1.0229 0.6874 0.37 
-80-.85 123 0.9474 0.6020 —-0.97 -80-.85 123 0.9235 0.7736 = -1.10 
-75-.80 123 0.9761 0.6480 —0.41 .75-.80 123 0.9760 0.9099 -0.29 
70-75 123 0.8576 0.7525 -2.10** 10-.75 123 1.0093 1.0292 0.10 
65-.70 123 0.9296 0.8458 —0.92 65-.70 123 0.9501 1.0715  —0.52 
.60-.65 123 0.8632 0.8191 = —1.85** .60-.65 123 0.8984 1.1686 -0.96 
55-60 123 0.8866 1.0456 —1.20 .55-.60 123 0.9579 1.1831 = -0.39 
50-.55 123 0.8295 0.9372 —2.02** 50-55 123 0.8033 1.2349 -1.77"* 
.45-.50 123 0.9129 1.2141 -0.80 .45-.50 123 0.8161 1.4092 -1.45* 
.40-.45 123 0.7647 1.2268  —2.13** .40-.45 123 0.9409 1.5550 —0.42 
.35-.40 123 0.7588 1.1234 —2,38*" 35-40 123 0.8699 1.6963 -0.85 
.30-.35 123 0.8685 1.6097 —-0.91 .30-.35 123 0.7072 1.7646 = —-1.84** 
.25-.30 123 0.4707  LIH9  —5.28"** .25-.30 123 0.8041 2.0297 -1.07 
.20-.25 123 0.7006 2.0045 —1.66** .20-.25 123 0.5855 2.0360 —2.26** 
.15-.20 123 0.4952 1.4297 -3,92*** .15-.20 123 0.5423 2.4428 —-2.08** 
10-.15 123 0.4779 2.4364 _ —2.38*** 10-15 123 0.5878 2.8156 -1.62* 
.05-.10 123 0.4920 3.6893 —1.53* .05-—.10 123 0.4872 3.3026  —1.72** 
.00-.05 123 0.3427 4.8288 —1.51* .00-.05 123 0,2968 3.4337 -2.27™ 
All All 
options 2.460 0.7926 2.0670 —4,98*** options 2.460 0.6535 2.4630 -6.98*** 


For both the S&P 500 and FTSE 100 option markets, the deep in-the-money one- 
month calls have a mean wealth relative close to one. The further the options are out of 
the money, the lower the mean payoff, as shown in Figures 5 and 6 using our interpola- 
tion method to give returns for odds spaced at every 1%. The pattern is quite striking for 
both markets: the payoff decays monotonically and is similar to the racetrack longshot 
bias shown in Figure 1. However, the only cases of mean returns significantly below 
one are for the FTSE 100 options, for which there is more retail activity. We have the 
usual problem of the large measurement error in expected returns measured with limited 
observations over short horizons. 

For all the put options, the pattern of mean returns for the one-month puts is 
extremely close to those found for the three-month put options.The deepest in-the- 
money puts pay on average the initial bet. Losses increase as the puts are further out 
of the money, displaying a similar longshot bias to Figure 1. 
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FIGURE 5 Mean return per dollar bet vs. odds levels: one-month stock index calls, 1985-2002. 
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FIGURE 6 Mean return per dollar bet vs. odds levels: puts on stock index futures, 1985-2002. 
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Figures 5 and 6 show the mean return for one-month options on the S&P 500 and 
FTSE 100 futures across continuous probability bandwidths. In Figure 5, S&P 500 call 
options return the $1 invested in each option on average. For the FTSE 100, the options 
tend to return slightly less than the initial investment. Therefore, there is no evidence 
of a favorite bias as the expected wealth relative is either equal to the initial investment 
or is statistically significantly below the initial investment. However, for both markets a 
significant longshot bias exists for the out-of-the-money calls in the probability ranges 
from 0% to 15% (the return is significantly less than the initial investment at a 90% 
level). However, the degree of the loss for the deep out-of-the-money calls in the 0% to 
15% range is smaller than for the three-month options seen in Tables 1 and 2. This is 
not surprising as the expected losses occur at an almost steady rate over time, and we 
have only a third of the previous time to expiration. 

The one-month put option returns appear in Figure 6. As with Figure 4 for the three- 
month put options, the mean return tends to decrease, as the option is further out of the 
money. For these options, the shape of the average return function is smoother than 
the three-month pattern. One possible explanation for this comes from Bollen and 
Whaley (2002). They indicate that the greatest concentration of trading in stock index 
put options is for put options with one month or less to expiration. Therefore, with more 
actively traded put options across the entire maturity spectrum, there is less need to 
interpolate. 


4. CONCLUSIONS 


The motivation for this research was to assess whether the favorite-longshot bias that 
has been found in horse racing and other gambling markets applies to options markets. 
The choice of stock index options was made due to a previous conjecture by Figlewski 
(1989) that deep OTM stock index call options are seen by investors as the equivalent of 
low cost/high payoff gambles and Dumas et al. (1998) that stock index put options are 
purchased at higher prices due to the need for insurance. We investigated the favorite- 
longshot bias for options on the S&P 500 Index Futures and FTSE 100 Index Futures 
for the 17+ years, March 1985 to September 2002. 

The deep OTM index call options on the S&P 500 futures and FTSE 100 futures 
have negative mean returns. During the period of 1985-2002, the mean payback from 
the purchase of three-month call options in the probability range of 0-5% was less than 
1.23 and 18.77 cents for every $1 invested in the options for the S&P 500 and FTSE 100, 
respectively. Deep in-the-money three-month calls and one-month calls on the S&P 500 
provide a mean return higher than the initial investment similar to the favorite-longshot 
bias in racetrack markets. 

For the put options on the S&P 500 and FTSE 100, we find evidence consistent 
with the hypothesis of Dumas et al. (1998) that investors pay more for puts than they 
are subsequently worth. The degree of overpaying for these options increases mono- 
tonically as the probability of finishing in the money decreases. These results are also 
consistent with Coval and Shumay (2001), for options in the similar strike price range 
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they considered in their research. We present an empirical study of these index option 
markets rather than a theoretical analysis of why the biases exist. We know that the 
put skew bias steepens when there is a large drop in the underlying index and flattens 
when prices rise and that the shape of the bias varies over time. The one-month versus 
three-month figures document this. The bias in the puts is related to portfolio insurance 
protection against downside risks. Harvey and Siddique (2000) call this skewness pref- 
erence. The deep out-of-the-money calls seem to be, as Figlewski (1989) argued, seen 
as lottery tickets which have low expected returns. The in-the-money and out-of-the- 
money three-month calls seem to reflect the equity risk premium as theoretically shown 
in Figure 2. The one-month calls in this range, which are highly price dependent on 
steep option time decay, are fairly priced for the S&P 500 and negatively priced for the 
FTSE 100. Investors’ aversion to downside risk shown in overpriced puts, especially fol- 
lowing the 1987 crash, is consistent with our data; see also Rubinstein (1994) and Bollen 
and Whaley (2002). This is similar to the pattern observed for the favorite-longshot bias, 
and is the expected cost of insurance. 

The month call options on the S&P 500 and the FTSE 100 have similar patterns, 
but with magnitudes closer to one. Only for the in-the-money calls on the S&P 500 is 
a favorite bias found. The deep in-the-money calls on the FTSE 100 pay an average 
return very close to the intial bet. For the out-of-the-money options, there is a reduction 
in the expected return (like a longshot bias). However, this is not as extreme as for the 
three-month options, and only statistically significant for the FTSE 100 options. 
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Abstract 


This chapter surveys the dosage breeding theory pioneered by Vuilliers, Varola, and 
Roman with major emphasis on two top classic three-year-old thoroughbred races, 
namely, the Kentucky Derby and the Belmont Stakes. Run at 1'4 mi and 1% mi respec- 
tively, they typically are at least 4% and ‘4 mi longer than any of the horses has ever 
raced before. This extra distance, combined with the large fields (especially in the 
Derby), make these two races a difficult test of stamina for horses this young. Bet- 
tors are also challenged because there is no direct evidence of whether a horse has 
the stamina to compete effectively at these distances. The informational content of the 
publicly available, pedigree-based measure of stamina, the Dosage Index, is used with 
simple performance measures to identify a semi-strong-form inefficiency. Statistically 
significant profits, net of transaction costs, could have been achieved during 1946-2006. 
This can be compared to the middle leg of the Triple Crown, the Preakness, run at '%., 
where the Dosage Index provided no advantage. 


JEL Classifications: G10, G14 


Keywords: semi-strong-form market efficiency, capital growth theory, speculative investments, 
sports betting 


1. INTRODUCTION 


The Kentucky Derby annually gathers many of the top three-year-old thoroughbred 
horses at Churchill Downs in Louisville, Kentucky on the first Saturday in May. For the 
horses entered, the race is a new challenge, since its distance of 1 mi is typically at least 
'4 mi longer than any of them has ever raced. The extra distance of the Kentucky Derby, 
usually combined with a large field that includes many top-flight contenders, presents a 
significant test of stamina for these young horses. Two weeks after the Kentucky Derby, 
many of the same horses plus others compete in the Preakness Stakes run at the shorter 
distance of '%,mi at Pimlico racetrack in Baltimore, Maryland. Then three weeks later, 
in early June, the 1⁄4 mi Belmont stakes is held at Belmont Park on Long Island, near 
New York City. And like the Kentucky Derby, typically none of the Belmont Stakes 
entrants has run this far this early in their careers. 

Since the horses in the Kentucky Derby and the Belmont are running at longer dis- 
tances than any earlier races, the public’s assessment of their stamina cannot be easily 
based on their past performances. Without direct evidence of a horse’s ability to run at 
the distances of these two races, bettors have looked to indirect evidence. One approach 
to assessing stamina that has received wide public attention looks at whether the sires in 
a horse’s pedigree have demonstrated a pattern of progeny with stamina. This approach, 
called dosage theory and described in Section 3, is coupled with evidence of success 
in major races as a two-year-old horse to study semi-strong-form efficiency of the 
Kentucky Derby by Bain, Hausch, and Ziemba (Bain et al., 2006; hereafter BHZ), and 
of the Belmont Stakes by Gramm and Ziemba (2007; hereafter GZ). 
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We describe the empirical analyses in BHZ and GZ, and summarize their results 
that dosage theory and success as a two-year-old horse point to semi-strong-form 
inefficiency. Two categories of horses are studied: dual qualifiers and asterisk qualifiers. 

Dual qualifiers are those with a speed/stamina balance consistent with the Derby 
and Belmont characteristics and who were strong enough as two-year-olds to be ranked 
within 101b of the top two-year-old. Asterisk qualifiers pass the dosage test but not 
the 101b test. However, they showed strength by winning a major race early in their 
three-year-old year. 

BHZ and GZ show that both dual qualifiers and asterisk qualifiers did in fact provide 
positive wagering profits in the Derby and even more so in the Belmont. Since the 
dosage theory is silent on the Preakness, these strong positive results do not apply there. 
The Kentucky Derby results are strongest for the pre-1996 period while the Belmont 
results are strong through 2006. There were no dual or asterisk qualifiers in the 2007 
Belmont. 

Roberts (1967) defined a market as being weak-form, semi-strong-form, or strong- 
form efficient if it is not possible to devise a profitable investment scheme net of 
transactions costs based on prices (or, for the racetrack, publicly available odds), based 
on all publicly available information, or based on all information, respectively. For tra- 
ditional financial markets, there is considerable evidence that points to weak-form and 
semi-strong-form efficiency, but little evidence for strong-form efficiency (see Fama, 
1970, 1991; and Keim and Ziemba, 2000, for surveys). 

Weak-form efficiency of the racetrack’s win market means that betting systems 
based solely on the public’s win odds, established through pari-mutuel betting, are 
not profitable. Evidence from many tracks over many years has pointed to weak-form 
efficiency, for example, Ali (1977) and Asch et al. (1982).! Weak-form efficiency of 
the win-betting market is a consequence of four of its features. First, transaction costs 
are high, about 13-20%, depending on track location, so a bettor needs to be con- 
siderably more successful than the average bettor just to break even.? Second, while 
the challenge is substantial, the concept of the win bet is relatively simple. Thus, 
bettors have no confusion about their task. Third, many racetrack bettors approach 
their wagering very seriously and some are very sophisticated. Fourth, for this serious 
audience, there is usually an abundance of relevant information, including records of 
past performances and workouts for all the horses, breeding, earnings, jockey records, 
and so on. 


"An exception may be extreme favorites at odds of 3/10 or less, which have been shown historically to pro- 
duce a small average profit; see Ziemba and Hausch (1986). However, data such as that shown in Ziemba 
(2008) and in Snowberg and Wolfers (2008) indicates that such positive profits currently do not exist with- 
out rebates. Inefficiencies in other more complex markets are more common; see Hausch et al. (1994) 
and other chapters in this volume for such evidence. Weak and semi-strong market efficiencies are dis- 
cussed in the chapters by Hausch and Ziemba (2008) and by Johnson and Sung (2008), respectively, in this 
volume. 

*Large bettors can reduce this take by betting at rebate sites that return a portion of the bet to make the actual 
take about 10%. We do not deal with such bettors here nor with those outside the U.S, who wager on Betfair 
or other betting exchanges against other bettors directly rather than in a pari-mutuel pool as discussed here. 
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For the Kentucky Derby and the Belmont Stakes, the first two of these criteria are 
satisfied for the win market. While the third criterion is met, the Kentucky Derby and 
Belmont Stakes also receive much more interest in North America from casual fans 
than other races. Because, typically, none of the Derby or Belmont Stakes entrants has 
raced at 1'4mi and 1'4 mi, respectively, it can be argued that the fourth criterion is not 
fully met. 

The objective in BHZ and GZ is to determine whether the informational content of 
the Dosage Index, a pedigree-based measure of stamina that is publicly available, in 
conjunction with simple performance measures, is captured in the pari-mutuel win odds 
and, if not, whether it can be used to develop a profitable betting scheme. 

The operation of the racetrack market is discussed in the first section. Section 3 
describes the Dosage Index and performance measures, and their application to the 
Kentucky Derby and Belmont Stakes. The data used by BHZ and GZ in their analy- 
ses are discussed in Section 4. Section 5 describes BHZ’s scheme for estimating each 
betting interest’s win probability based on the public’s odds, the Dosage Index, and the 
performance measures, with an application of this technique to the Kentucky Derby. 
The Kelly capital growth betting model is described in Section 6. The results from 
BHZ2’s joint application of their probability estimation scheme and Kelly wagering for 
the Kentucky Derby appear in Section 7. Sections 8 and 9 discuss GZ’s results for the 
Preakness and Belmont Stakes for 1946-2007. Conclusions appear in Section 10. 


2. THE RACETRACK AS A SEQUENCE OF MARKETS 


Prior to a race, bettors engage in markets that establish prices for the various betting 
opportunities for that race. Betting closes immediately before the race begins, and 
payouts are calculated immediately following the race. For win betting, there are N 
betting interests in a race. Let W; be the total amount bet to win on betting interest 
i=1,...,N. The win pool is 


w=), (1) 


The track payback, Q (generally 0.80 to 0.87 for win bets), is the fraction of each dollar 
bet that is returned to the bettors. The commission, or track take, is 1 — Q. If betting 
interest k wins the race, then win bets on betting interests i # k return zero, while each 
dollar bet on betting interest k returns approximately QW/W,. The actual profit per 
dollar is rounded down to the nearest nickel or dime (this is called breakage). Together 
the track take and breakage constitute the transaction costs.* 


JOur notation deals with one race only. In Section 5, to deal with several races simultancously, we add a 
superscript to our notation to identify the race number. 

4For cach track there is a minimum payout, usually 5%, that the track must return even if there are insufficient 
funds available in QW. If there is a rebate, then the track take is effectively reduced to about 10% so Q is 
about 0.90. 
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Typically, each horse in a race runs as a separate betting interest. However, two or 
more horses in a race that have common ownership typically run as a single betting 
interest known as an entry. In addition, in a race where there would be more betting 
interests than a preset maximum, the horses with the least-impressive credentials are 
grouped as a single betting interest known as the field. A bet to win on an entry or 
the field pays off if any member of that betting interest wins the race. Both have been 
common in the Kentucky Derby but not in the Belmont Stakes. However, the long-time 
restrictions in Kentucky changed in 2001, so there were no entries and no field in the 
Kentucky Derby from 2001 on. 


3. THE DOSAGE INDEX AND PERFORMANCE MEASURES 


The fact that usually no Derby (or Belmont) entrant has raced at 1'4 (1'4) mi prior to the 
race has led to the search for relevant information from alternative sources, including 
the horse’s pedigree. One method of evaluating a thoroughbred’s pedigree, commonly 
known as dosage theory, has its roots in the work of French cavalry officer Lt. Col. 
Jean-Joseph Vuillier, who studied the pedigrees of exceptional thoroughbreds of the late 
nineteenth and early twentieth centuries; see Vuillier (1902, 1906, 1928). The concept 
of thoroughbred dosage evolved through Varola, who developed a patented classifica- 
tion of prominent stallions according to the type of offspring that they produced in a 
series of articles in The British Racehorse; see also Varola (1974, 1980). 

Roman’s (1981) modifications of Varola’s work are known as dosage theory. His 
work was outlined in Leon Rasmussen’s Bloodlines column in the Daily Racing Form 
beginning before the 1981 Kentucky Derby. One product of Roman’s pedigree analysis 
is the Dosage Index (DI), which is based on the categorization of prominent stallions 
in terms of whether they consistently sire offspring with distance proficiencies that are 
incongruous with the dosage profiles of those offspring when that stallion is excluded. 
Classified stallions are called chefs-de-race (or simply chefs);> see Ziemba and Hausch 
(1987) and Roman’s Website (http://www.chef-de-race.com) for the rationale behind 
the selection of recent chefs, and Roman (2002). 

There are five categories for chef classification in Roman’s system: Brilliant, Inter- 
mediate, Classic, Solid, and Professional. The categorization is based on “where they 
[sires] must lie on the speed-stamina spectrum to bring the figures of their descendants 
back in line with those of horses in the general population exhibiting similar perfor- 
mance traits” (Roman, 2001). A chef can be placed in one or two categories. Each time 
a chef appears in a four-generation pedigree, points are awarded in the appropriate cat- 
egory. Points are assigned on a scale of 16 for the first-generation sire, eight for each 
second-generation sire, four for each third-generation sire, and two for each fourth- 
generation sire. Sires that are classified in two categories have their points split. After 


5Mares are not included because they are considered to have too few offspring to identify distance 
proficiencies, while it is not unusual for a stallion to sire 100-200 offspring in a year. 
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the 15 sires have been assigned points, the total for each category is entered into the 
Dosage Index formula 


_ Brilliant + Intermediate + 1/2 Classic 


Solid + Professional + 1/2 Classic (2) 

Horses with a high DI have a pedigree that is weighted towards Brilliant and Inter- 
mediate chefs, that is, sires who tend to produce offspring with greater sprinting ability 
than their pedigrees would suggest if that sire were eliminated from the pedigree. Horses 
with a low DI are predicted to have stamina. Very seldom will a stakes-quality horse 
have no dosage points, though some have so few that the D/ is unreliable. The pedi- 
gree and the dosage profile for 2005 Belmont Stakes winner Afleet Alex are shown in 
Table 1. Each pedigree shows the sire and mare for each horse for four generations. For 
example, Afleet Alex’s sire and mare were Afleet and Nurvette, with their respective 
sires and dams shown directly to their right in the pedigree. 

After the initial classification of chefs in 1981, Roman found that no Kentucky Derby 
winner from 1940 to 1980 had a DI exceeding 4.0, despite about one in seven entrants 
having a DJ that high. 

The Dosage Index is not a direct measure of the quality of a horse. One 
quality measure is the experimental free handicap (EFH), an annual ranking of 
two-year-old thoroughbreds that raced in select races in the United States. (see 


TABLE1 Pedigree and Dosage Index Calculation for 2005 
Belmont Stakes Winner Afleet Alex 


Mr. Prospector Raise a Native (B) 
(B/C) Gold Digger 
Venetian Jester 
Polite Lady 
Friendly Ways 
Northern Afleet 
Northern Dancer (B/C) 
Nureyev (C) 
Special 
Nuryette = 
Tentam 
Stellarette 
Square Angel 
[ Roberto (C) 
Silver Hawk 
Gris Vitesse 
Hawkster 
Chieftain 
Strait Lane 
M Hawk Level Sands 
a: aw 
e5 Utrillo II 
Hawaii 
Ethane 
Qualique ko 
Sensitivo 
Dorothy Gaylord 
Gaylord’s Touch 
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Generation Sire Brilliant Intermediate Classic Solid Professional 

1 Northern Afleet 
2 Afleet 

Hawkster 
3 Mr. Prospector 2 

Nureyev 4 

Silver Hawk 
Hawaii 

4 Raise a Native 2 


Venetian Jester 

Northern Dancer 1 i] 
Tentam 
Roberto 2 
Chieftain 
Utrillo I 
Sensitivo 

Total 5 0 9 0 0 


NOTE: Dosage Index = (5 + 0 + 9/2)/(0 +0 + 9/2) = 2.11. 


http://www.jockyclub.com/experimental.asp). Conducted since 1933 by the Jockey 
Club, the EFH assigns the top runners a figurative weight on a scale that usually has 
the two-year-old champion weighted at 126 lb.° Exceptional horses have been weighted 
up to 1301b. Other top horses are assigned lower weights based on perceived ability 
until a cutoff is reached at about 1001b beyond which no more horses are classified. 
Usually there are 15 to 30 horses classified within 101b of the top-weighted horse. 
Roman (1981) observed that starting in 1972, most Kentucky Derby winners were rated 
within 10 1b of the top-weighted horse. This observation led to the designation “dual 
qualifier” for any horse that was weighted within 10 lb of the top-weighted horse on the 
EFH (indicating the quality of the horse) and had a DJ less than or equal to 4.0.’ 

Professional handicapper James Quinn offered a second measure of quality to add 
late-developers to the list. He defined what we call an “asterisk qualifier” to be any 
horse that: (1) won at least one of a selection of premier races prior to the Kentucky 
Derby or Belmont Stakes; (2) had a D7 less than or equal to 4.0; and (3) was not rated 
within 10 Ib on the EFH. A horse is a “dual-or-asterisk qualifier” if it qualifies for one 
of these two categories. 


Ranking horses by weight is a familiar concept at the racetrack. In handicap races, the top horses carry greater 
weight (jockey + saddle + additional weights if necessary) than the less-qualified horses. Handicapping of 
this sort occurs only in select races and is intended to make the race more competitive. 

7Some people expand the dual qualifier category to include any horse that is declared a champion in a country 
other than the U.S. and has a DZ less than or equal to 4.0. In this chapter, only the first definition was used. 
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The objective of BHZ and GZ was to study whether these widely publicized mea- 
sures have any predictive power that is not incorporated into the public’s pari-mutuel 
win odds. If not fully incorporated, then a further objective was to investigate whether 
these measures could be used to determine win probability estimates that are sufficiently 
superior to the public’s so that a profitable wagering scheme based on win betting could 
be developed, despite the significant transaction costs. 


4. DATA ACQUISITION 


This section discusses the nature of the data in BHZ and GZ, while the sources of their 
data are described in the Appendix. 

The public’s win betting pool and results were collected for the period 1946 to 2007. 
For 54 of these years, dollar amounts that the public wagered were found which yielded 
precise values for q;. For the other eight years, only the final win odds for each betting 
interest were available. In these cases it was possible to back out win probabilities that 
were consistent with these odds. 

The EFH listing and pedigree information for each Derby participant were collected 
for each year from 1946 to 2007. The original list of chefs was published in 1981. For 
years prior to 1981, this list was used, which means that the classification of chefs for 
1946 to 1980 is not completely out of sample. The Kentucky Derby hypothetical betting 
begins in 1981, so all betting is based on lists of chefs that were out of sample. For the 
period 1981-1986, the 1981 list was used (see Appendix for explanation). After 1986, 
an updated list of chefs was used each year. The Belmont calculations are for 1981-2006 
and 1946-2006, with no dual or asterisk qualifiers in 2007. 

The major races for asterisk-qualifier status, with their 2007 graded stakes clas- 
sification and the years that they have been run over the interval 1946-2005, were 
the Blue Grass Stakes (G1; 1946-2005), the Flamingo Stakes (currently not run; 
1946-1989, 1992-2001), the Florida Derby (G1; 1952-2005), the Santa Anita Derby 
(G1; 1946-2005), and the Wood Memorial Stakes (G1; 1946-2005). The Flamingo 
Stakes declined in importance before being cancelled, but was included because 
historically it was an important prep race. 


5. APPLICATION OF BREEDING INFORMATION AND 
PERFORMANCE MEASURES TO REFINE ESTIMATED 
WIN PROBABILITIES FOR THE KENTUCKY DERBY 


BHZ developed two models for estimating win probabilities that depended on whether 
a betting interest was a dual qualifier or a dual-or-asterisk qualifier. 

The 1995 Kentucky Derby is used in Table 2 to illustrate the required information. 
Also evident is a complication with regard to accounting for pedigree with entries (and 
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TABLE 2 Sample Input Data: 1995 Kentucky Derby Field 


Qualifier status 


Horse Wi/W Entry EFH Won specified DI Dual Asterisk 
Jambalaya Jazz 0.044 1 115 1.15 

Pyramid Peak — Flamingo 3.00 . 
Serena’s Song 0.189 2 122 2.11 e 

Timber Country 126 3.29 e 

Mecke 107 4.50 

Knockadoon — 3.57 

Citadeed 0.066 Field — 1.60 

In Character — 1.77 

Ski Captain — 3.67 

Lake George — 4,50 

Thunder Gulch 0.033 116 Florida Derby 4.00 e 

Tejano Run 0.087 121 2.38 e 

Jumron 0.126 115 3.80 

Eltish 0.070 123 3.00 ° 
Afternoon Deelites 0.086 124 5.00 

Suave Prospect 0.059 113 4.60 

Talkin Man 0.167 114 Wood Memorial 3.00 ° 
Dazzling Falls 0.029 111 6.20 

Wild Syn 0.042 — Blue Grass 4.33 


NOTE: W;/W: Post-time fraction of win pool. Entry: Entry number or field. EFH: Experimental 
free handicap weight. Blank implies not weighted. (High weight for two-year-olds from 1994 was 
126 tb.) Won specified race: e Winner of a major race prior to Kentucky Derby. D1: Dosage Index: see 
Equation (2). Qualifier status: e Implies meets qualifier requirements. Source: Bain et al., 2006. 


the field): the horses in an entry may not have the same qualifier status. This difficulty 
was handled using the following scheme: 


l. 


2. 


If all members of an entry had the same qualifier status, then the entry was 
considered as one horse with that qualification. 

If one member of the entry was a dual qualifier plus had won any of the desig- 
nated major races prior to the Kentucky Derby, the entry was considered to be a 
dual qualifier regardless of the qualifications of the other member(s) (based on 
the presumption that in most cases most of the public’s attention on the entry was 
due to that horse). 


. If the members of an entry did not all have the same qualifier status, but each 


was either a dual qualifier or an asterisk qualifier, then the entry was viewed as a 
dual-or-asterisk qualifier. 


. Inall other cases the entry was considered to be neutral, that is, neither an asterisk 


qualifier nor a dual-or-asterisk qualifier. 
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The qualifier status of the field was determined in the same manner. For the dual- 
qualifier model there are 67, 0, 10, and 22 betting interests in the respective categories, 
and for the dual-or-asterisk-qualifier model there are 57, 2, 10 and 30 betting interests 
in the respective categories. 

With respect to the dual-qualifier model, of the winners there are 29 that are quali- 
fiers, 26 that are not qualifiers, and 3 that are part of a neutral entry. With respect to the 
dual-or-asterisk-qualifier model, of the winners there are 41 qualifiers, 16 that are not 
qualifiers, and 3 that are part of a neutral entry. In 1998 and 2003 there were no dual 
qualifiers, so those years were ignored in the dual-qualifier modeling. 

The base-case model relates a betting interest’s win probability to the public’s wager- 
ing to see if looking solely at the pools without the “expert information” could lead to a 
profitable betting scheme. 

Let W;’ be the public’s win bet on betting interest i, W/ be the win pool in race j, and 
N/ be the number of betting interests in race j. For race j, define p to be the probability 
that betting interest i wins and define q} = W7 /W/ to be the fraction of the win pool 
bet on betting interest i. For this base case, the following model was used for each race 


j CO 


Pi = o R (3) 
2 (am) 


If ò = 1, then p; = q}. 

BHZ used a standard maximum-likelihood approach to estimate optimal values of 
ò. Consider R independent races and define K = (k1,..., kpg) to be an R-tuple repre- 
senting the winners of the R races, that is, k; is the number of the betting interest that 
won race j. Let Pr, represent the estimated probability based on Equation (3), evalu- 
ated before race j, that betting interest k; wins race j. The probability that the vector 
K corresponds to the winners of the R races is 


R 
PK) =|] Pez (4) 


j=l 


Treating Equation (4) as a likelihood function that depends on 8 gives 


R 
elk [fo (5) 


j=l 


A maximum-likelihood point estimate for 5, namely dy, can be found by maximizing 
the likelihood as a function of 6. The first value for m, was calculated using the first 10 
years of data, namely 1946-1955 inclusive. Thereafter, the value of m, was updated 
for each year using data from 1946 to that year. The win pool fraction for the winner 
and values for ôm, calculated after each year’s race are shown in Figure 1. 
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FIGURE 1 Value for òp after each year’s race and the fraction of the win pool bet on the winner, 
1955-2005. Source: Bain et al., 2006. 


The values for òm are less than 1.0 for the years prior to 1974. This is a conse- 
quence of the public’s more favored betting interests winning less often during this 
period than would have been expected based on the public’s odds. The winners from 
1972 to 1979 were dominated by favorites, culminating with Spectacular Bid in 1979, so 
dmx increases over this interval, reaching a maximum value of 1.12. During the period 
of 1980-2005, the public’s favorite seldom won, so mı tends to decrease to it final 
value 0.92.8 

With values of òm, close to 1.0, Equation (3) generates revised win probabilities that 
differ only slightly from the fraction of the win pool. The greatest ratio p;/q; over the 
interval 1981-2005 is 1.12. This 12% edge is insufficient to offer a positive expected 


8Griffith (1949), McGlothlin (1956), Ali (1977), Asch et al. (1982), and Ziemba and Hausch (1986), among 
others, have demonstrated that the public’s wagering has a strong and stable bias of underbetting the favorites 
and overbetting the longshots. This results in è > 1.0. Ziemba and Hausch (1987) provided evidence that 
this “favorite-longshot bias” is exhibited at the Kentucky Derby but it is weaker, that is more flat, than in 
these earlier studies. The recent advent of rebate and betting exchange wagering has led to a flattening of the 
favorite-longshot bias in recent data since about 1998; see Ziemba (2004, 2008) and Snowberg and Wolfers 
(2008). 
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return on a win bet after accounting for the transaction costs; hence this simple model 
points to weak-form efficiency of the win-betting market over this period. 

The main objective of BHZ was to use this procedure to create models that modified 
the win probability for each betting interest based on whether or not it was considered 
a dual qualifier or considered a dual-or-asterisk qualifier. 

For this case, BHZ viewed the probability of betting interest i winning to be 


. CAN 
Pi = ———., Yin Ym =, B, or 1. (6) 


No 
x (Gn) 


The variable y; = a if betting interest i was a dual qualifier (or dual-or-asterisk qualifier 
depending on the test), y; = 8 if it was classified as not a dual qualifier (or not a dual- 
or-asterisk qualifier if applicable), and y; = 1 if the betting interest was an entry or field 
classified as being neutral. 

Based on Equation (6), BHZ calculated annual maximum-likelihood values for œ 
and B, denoted as am, and Buz, were calculated each year. The initial estimate for 
1956 used the first 10 years of data (1946-1955). Figure 2 illustrates the progression 
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FIGURE 2 Values for æm, and Bmx for dual-qualifier model after each year’s race, 1955-2005. 
NOTE: In the figure, Qua = dual qualifier; Neu = member of a neutral entry; and Non = non-qualifier. 
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FIGURE 3 Value for amu and Bz for dual-or-asterisk qualifier model after each year’s race, 1955-2005. 
NOTE: In the figure, Qua = dual qualifier; Neu = member of a neutral entry; and Non = non-qualifier. 
Source: Bain et al., 2006. 


of am and Bur values for the dual-qualifier model, and Figure 3 shows amy and BML 
values for the dual-or-asterisk-qualifier model. 

In Figures 2 and 3, the critical pattern is the relative magnitude of amı and Ba. 
In Figure 2, for nearly 20 years, am, exceeds By. Consequently, for this period, the 
revised win probability for each dual qualifier is less than the fraction of the money bet 
on it in the win pool. This implies that betting on dual qualifiers, if they had been known, 
would not have been advantageous during that period. In the mid-1970s, dual qualifiers 
began to win consistently, eventually leading to Bu exceeding amr, for the remainder 
of the study period. For this later period, the revised win probabilities for dual qualifiers 
exceed their fraction of the win pool. Figure 3 for dual-or-asterisk qualifiers shows a 
similar pattern, although By begins to exceed amy after only three years. Thus, after 
the third year, the model predicts win probabilities for dual-or-asterisk qualifiers that 
exceed their fraction of the win pool. 

The original and revised estimates of win probabilities for 1995 are in Table 3, where 
a betting interest’s estimated win probability rises if it meets the qualifier criterion. 
This is not necessarily the case if there are many qualifiers because the sum of the 
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TABLE 3 Original and Revised Estimated Win Probabilities 
for the 1995 Kentucky Derby 


Betting interest W/W DQ PDQ DAQ  ppag 
Entry 1 0.044 -1 0.029 0 0.027 
Entry 2 0.189 1 0.242 1 0.220 
Field 0.066 ~1 0.044 -1 0.035 
Thunder Guich 0.033 1 0.075 1 0.076 
Tejano Run 0.087 1 0.144 1 0.137 
Jumron 0.126 -1 0.085 -1 0.068 
Eltish 0.070 1 0.125 1 0.121 
Afternoon Deelites 0.086 -1 0.057 -1 0.046 
Suave Prospect 0.059 -1 0.039 -1 0.031 
Talkin Man 0.167 ~1 0114 1 0.204 
Dazzling Falls 0.029 -1 0.019 -l1 0.015 
Wild Syn 0.042 -1 0.027 -l1 0.022 


NOTE: W;/W: fraction of win pool bet on the betting interest. DQ: 
Indicator is 1 for dual qualifiers, 0 for unclassified entries, and —1 for 
non-qualifiers. ppg: Revised estimated win probability based on dual- 
qualifier model. DAQ: Indicator is 1 for dual-or-asterisk qualifiers, 0 for 
unclassified entries, and —1 for non-qualifiers. ppag: Revised estimated 
win probability based on dual-or-asterisk qualifier model. Source: Bain 
et al., 2006. 


probabilities is unity. The effect of considering asterisk qualifiers is demonstrated by 
Talkin Man, an asterisk qualifier but not a dual qualifier. Talkin Man’s estimated win 
probability varies from 0.114 to 0.204 depending on the criterion used. 

The revised win probability estimates are occasionally sufficiently greater than the 
fraction of the win pool to allow a positive expected return even considering transaction 
costs. 

The percent increase in the win probability over the fraction of the win pool for 
Thunder Gulch is much greater than for Entry 2. There is a general tendency for p;/q; 
to increase for qualifiers as q; decreases, which is a consequence of the power function 
model in Equation (6) together with values of œ < B anda < 1. 


6. THE KELLY BETTING MODEL 


BHZ calculated betting amounts using the Kelly-optimal capital growth model which 
maximizes the expected logarithm of wealth on a race-by-race basis. This approach 
was proposed by Kelly (1956), and was extended and rigorously proved by Breiman 
(1961) and Algoet and Cover (1988). Among its properties are: (1) it maximizes the 
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asymptotic growth rate of wealth; (2) it asymptotically minimizes the expected time to 
reach any specific sufficiently large wealth level; and (3) in the long run, it outperforms 
any other essentially different betting strategy almost surely and asymptotically provides 
infinitely more final wealth than any other essentially different strategy. (See MacLean 
et al., 1992, 2006; and Thorp, 2006, for further properties, and Ziemba and Hausch, 
1986, for simulation results for shorter time horizons.) 

The revised probability of betting interest i winning based on dual-qualifier or dual- 
or-asterisk qualifier status is p;. Let r; be the gross return per dollar bet based on the 
win odds established by the public. (As in Section 1, the superscript indicating the race 
number is suppressed.) The model requires solving the following optimization problem 
for each race: 


N N N 
maximize F nos(1-F tsr) st fi20Yi=1,...,N and Ñ fiS 1. 


m=1 i=l 


(7) 


The decision variable, f;, is the fraction of the current wealth to bet on betting interest 
i. Suppose that the bettor’s initial wealth is w and betting interest i wins. Then w f;r; is 
returned to the bettor after having invested 


N 
wÈ, Sus 
m=) 
for a final wealth of 


N 
v(t Dae sin), 


m=} 


The objective function is the logarithm of final wealth for each betting interest winning, 
weighted by the probability of that betting interest winning. Initial wealth, w, can be 
disregarded in the formulation with the non-negative decision variables the fraction of 
wealth that is bet on each betting interest. The constraints comprise a budget constraint. 

This formulation assumes that the bets are sufficiently small so that they do not influ- 
ence the payout on any betting interest, that is, the bets on betting interest i do not reduce 
ri. For the Kentucky Derby and the Belmont Stakes, the win betting pools are so large 
that a typical bet is unlikely to influence the payouts.° The large pools also permit this 
assumption because the percent bet on each betting interest is assumed to vary little in 
the final few minutes. 


°See Hausch et al. (1981) for a formulation that does account for the bettor’s effect on payouts. 


538 


Calendar Anomalies and Arbitrage 


322 


Chapter 15 e Dosage Breeding Theory 


7. THE KENTUCKY DERBY, 1981-2006 


In BHZ’s study of semi-strong efficiency of the Kentucky Derby win market, they 
started by revising win probabilities using the base-case model [Equations (3-5)]. 
The revisions were sufficiently close to the public’s win probabilities that expected 
returns were negative for all betting interests during the period 1981-2005. Thus, 
the optimization problem in Equation (7) with these revised probabilities led to no 
wagers. 

For the models based on Equation (6) and on status as a dual or dual-or-asterisk 
qualifier, BHZ started with an initial wealth of $2,500 in 1981. Wealth was updated 
each year based on the bets made and the actual outcome of the race. The wealth history 
for betting from 1981 to 2005 is shown in Figure 4 for dual qualifiers and for dual-or- 
asterisk qualifiers. The overall results are summarized in Table 4. For the years up to 
the mid-1980s, any advantage identified by the model is small and results in small bets. 
Comparing the values in Figure 4 to those in Figures 2 and 3 shows that ayy and By. 
are relatively close over that interval. As the model predicts a greater advantage, the 
amount per bet, and consequently the volatility, grows. 
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FIGURE 4 Wealth level history for Kelly win bets, 1981-2005. Source: Bain et al., 2006. 
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TABLE 4 Profits from Both Models Based on Kelly Bets 
and Revised Win Probabilities 


Model based on qualifier type 


Dual Dual-or-asterisk 
Number of bets 61 107 
Total amount bet $32,828 $66,467 
Number of bet that won 9 14 
Initial wealth $2,500 $2,500 
Final wealth $5,514 $4,889 
Total profit $3,014 $2,389 
Percent return on investment 9.2 3.6 


Source: Bain et al., 2006. 


For comparison, betting $2,500 on the favorite to win from 1981 to 2005 would 
yield a loss of $41,500, betting $200 to win on each dual qualifier would yield a profit 
of $12,920 on $13,200 bet, and a $200 bet to win on each dual-or-asterisk qualifier 
would yield a profit of $7,780 on $23,000 bet (neutral entries excluded as qualifiers). 
The improved return on investment compared to the OCGM in the short run is mostly 
due to a few huge profits on qualifiers Gato del Sol in 1982, Ferdinand in 1986, and 
Thunder Gulch in 1995. 

For both models the betting scheme produced profits during the 1980s and up to 
the mid-1990s, but the performance has been poor since. Several possibilities can be 
considered for this: 


1. The sample size is small. This could mean that the sequence of successes for 
qualifiers for both models from 1972 to 1997 was a short-term run, so that in the 
long run there is nothing to be gained from using either model developed here. 
The limited sample size also implies that the final wealth is sensitive to individual 
race results. To give an idea of the scope, two extreme examples are (i) if a non- 
dual-qualifier had won in 1995, instead of Thunder Gulch, the final wealth for the 
dual-qualifier model would be $2,221, that is, a slight loss overall, and (ii) if 2005 
winner Giacomo had a favorable change of a single dosage point in any category, 
Giacomo would have been a dual qualifier and the final wealth would have been 
$13,877. 

2. It is extremely difficult to make a proper assessment of all of the two-year-olds, 
so the EFH can omit suitable horses. A pointed example occurred in 2003 where 
eventual 2004 Derby winner Smarty Jones was not rated on the EFH. Yet, he 
had overwhelmed a field of state-bred two-year-olds at Philadelphia Park, but 
that race is not counted when determining the EFH. Roman (2005a) lists other 
Derby winners such as Winning Colors and Sunday Silence who were superior 
two-year-olds but were not rated on the EFH. 
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3. Classification of chefs is an ongoing exercise. For example, Alydar was classified 
as a chef subsequent to Strike The Gold winning the Derby in 1991, so Strike The 
Gold, who won the Blue Grass Stakes, is not considered as an asterisk qualifier 
here (DJ = 9.00), yet when Alydar was classified, Strike The Gold’s DI was 
reduced to 2.60. Ziemba (1991) wrote a column about this on April 28, 1991, 
one week prior to the 1991 Derby arguing that Alydar should be a classic chef 
as Alydar had numerous classic distance winners. The reclassification of Alydar, 
and at what point, would possibly change other pedigrees. However, BHZ and 
GZ use Roman’s classification, so Strike the Gold is neither a dual nor an asterisk 
qualifier. 

4. Roman (2005b) has pointed out the gradual rise in the DI of Derby winners over 
time, so the failure of the system in the last few years of the study may reflect 
a shift of the overall breed in North America toward speed at shorter distances. 
Real Quiet in 1998, Charismatic in 1999, and Giacomo in 2005 all had DJ values 
greater than 4.00. 

5. The Flamingo Stakes decreased in significance in the final years that it was run. 
Including the Flamingo winner as an asterisk qualifier in recent years was unwar- 
ranted in retrospect. Two possible solutions are to drop the Flamingo at some 
point in the analysis, or switch to the Arkansas Derby as the fifth significant 
prep race. 


Random betting generates expected losses in excess of 16%, due to the 16% track 
take plus breakage. BHZ showed both qualifier designations approximately doubling 
wealth over the betting period. They also used two approaches to address the statistical 
significance of these profits. The first approach treats a betting interest’s win or loss as a 
binomial random variable and then uses a normal approximation. The second simulates 
the set of races assuming random wagering. 

Before considering their first approach, observe that the data in Figure 4 are not 
ideal for addressing statistical significance. Wealth generally grows until the mid- 
to-late 1990s, and then dramatically falls. This pattern of wins and losses leads to 
wealth that is highly variable. Focusing on just the dual-qualifier case, Figure 5 super- 
imposes on Figure 4 the wealth level history assuming that the races were run in 
reverse order, that is, we started in 2005 with $2,500, then updated wealth based 
on our results in 2005 and went to the 2004 race, and so on. Thus, the string of 
large losses occurs early with lower wealth, after which wins are more common. 
The final wealth is identical, since the optimal capital growth system simply deter- 
mines the optimal fraction of wealth to bet each year. (For example, losing 10% one 
year and gaining 20% the other year leads to an overall return of 8%, whatever the 
order of the win and the loss.) Despite the final wealth being the same, the wealth 
histories are very different, as is the appearance of any statistical significance to the 
profits. 

To eliminate the effect of varying wealth, which can dampen or intensify the vari- 
ance in profits, BHZ’s test of statistical significance uses bets and returns in each race 
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FIGURE 5 Betting wealth for Kelly win bets on dual qualifiers with races run forward (1981-2005) and 
run backward (2005-1981). Source: Bain et al., 2006. 


assuming an identical initial wealth each race. They do not update wealth year by year 
(as in Figure 4); instead the initial wealth each year is assumed to be $2,500. 

Let q be the probability of winning a bet in each trial, n be the number of trials, 
c be the amount wagered each trial, and r be the gross return upon winning,'® and let 
X be the random variable representing the number of wins. The probability of profits 
exceeding a constant 7 is 


P[rX -ne >]. (8) 


Assume that the trials are independent. For bets in different races, this assumption is 
reasonable. For multiple bets on the same race—which are common—wins are nega- 
tively correlated, since if one betting interest wins then the others must lose. Negative 
correlation leads to a tighter distribution of wins, so this analysis based on independent 


101m practice, q, c, and r vary across races and even within races if there are multiple wagers. We approximate 
the sequence of wagers by using the average values of these parameters. 
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trials underestimates the statistical significance of the results. Since X is binomially 
distributed, the normal distribution approximates Equation (8) as 


Rene _ ng 
1-o| —— |, (9) 
| =| 


where ® is the cumulative distribution function of a standard N (0, 1) variable. 

For dual qualifiers and assuming an initial wealth of $2,500 each year, there were 
61 bets totaling $7,079, and of these bets, nine won for a gross return of $11,357 and a 
profit of $4,278. Thus, c = 7,079/61 = 116.0 and r = 11,357/9 = 1,262. If the system 
were no better than random betting, then q satisfies rg — c = —0.16c, recognizing the 
16% track take, giving g = 0.07725. Then, by Equation (9), the probability of profits of 
at least the observed level of $4,278 given random betting is 2.0%. Suppose instead that 
the system is better than random betting but only good enough to offer zero expected 
profits. Then q solves rq — c = 0, or q = 0.09192, and, by Equation (9), the probability 
that such a system would produce at least the observed profits is 6.7%. 

For dual-and-asterisk qualifiers, with initial wealth of $2,500 each year, there were 
107 bets totaling $13,268; 14 of these bets won for a gross return of $18,253 and a 
profit of $4,985. Thus, c = 124 and r = 1,304. If the system were no better than random 
betting, then g = 0.07988 and the probability of profits of at least the observed level 
is only 2.6%. Assuming instead that the system is better than random betting but only 
good enough to offer zero expected profits, then q = 0.09509 and the probability that 
such a system would produce at least the observed profits is 10.4%. 

The second approach used by BHZ to address the statistical significance of the results 
involved two simulations for each qualifier designation. The first simulation dealt with 
the question of how likely it would be that profits at the observed level would have 
been generated if their approach was vacuous and, therefore, was essentially random 
wagering. The second simulation asked how likely it would be that the observed profits 
would have been generated if the system was able to improve upon random wagering, 
but only enough to achieve zero expected return on each wager (excluding breakage). 
The algorithm for the first simulation was 


1. Start with a betting wealth of $2,500 in 1981. 

2. Determine the fraction of wealth to wager on each betting interest i for the current 
year based on the Kelly criterion and the (wrong) assumption that our probability 
estimate, p;, is correct. 

3. Randomly select the winner, with the probability of winning for betting interest 
i being qj. 

4. Based on the simulated winner, its payout and our wagers, update wealth. 

5. Repeat steps (2) to (4) for each year in order up to 2005. 


The second simulation differed only in step 3, where the simulation used q;/Q as 
the correct win probability for any betting interest i that received a wager in step 2. The 
expected return on every wager was zero, before accounting for breakage. The collective 


Chatper 23: The Dosage Breeding Theory for Horse Racing Predictions 543, 


Marshall Gramm and William T. Ziemba 327 


TABLE 5 Results from 10,000 Betting Simulations with $2,500 Initial Wealth 


Dual qualifier Dual-or-asterisk qualifier 
Simulation 1 Simulation? Simulation! Simulation 2 
Final wealth < $1,000 (%) 54.3 37.0 73.8 49.3 
Final wealth < $2,500 (%) 84.2 72.6 91.5 76.1 
Final wealth > system’s final 3.9 9.2 3l 11.4 
wealth (Table 4) (%) 

Mean final wealth $1,555 $2,449 $1,030 $2,473 
Median final wealth $892 $1,371 $450 $1,022 
Maximum final wealth $67,601 $161,777 $97,336 $409,294 


Source: Bain et al., 2006. 


win probability of the other betting interests was such that probabilities summed to 
one. For example, in a three-horse race with [g), q2, q3] = [0.42, 0.21, 0.37] and Kelly 
bets having been placed on the first two horses based on [ pi, p2, p3], the probability 
that the simulation would select each betting interest as the winner was [0.42, 0.21, 
0.37] for Simulation 1, and was [0.5, 0.25, 0.25] for Simulation 2 after dividing the first 
two fractions by 0.84. The simulations were run 10,000 times each. The results are in 
Table 5. 

For all simulations, losses occurred more than 70% of the time. For Simulation 2 
and for both types of qualifiers, the mean final wealth was close to $2,500, which is 
expected given the modification in step 3 of this simulation. For dual qualifiers, only 
3.9% of the time did Simulation 1 realize profits as high as our observed profit. For 
Simulation 2, the corresponding value is 9.2%. For dual-or-asterisk qualifiers, these 
values for Simulations 1 and 2 are, respectively, 3.1% and 11.4%. 

As a final test, the analyses were conducted using limited data. For example, if the 
year being considered was 1997 and the interval was 25 years, only information from 
1972 to 1996 would have been applied. For the dual-qualifer model, using the entire 
dataset produced the greatest final wealth; while for the dual-or-asterisk model, using 
an interval of 56 years produced a final wealth of $5,305. 


8. THE PREAKNESS STAKES, 1946-2006 


The Preakness Stakes, unlike the Kentucky Derby and Belmont Stakes, does not provide 
a new test of stamina for its competitors. The race is run two weeks after the Kentucky 
Derby and is 1/16 mi shorter in distance. The application of the dosage system would 
likely be negated by the fact that entrants in the Preakness exiting the Kentucky Derby 
have a quantifiable result of their ability to run longer distances. GZ find that dual quali- 
fiers and asterisk qualifiers outperform non-dual qualifiers and non-asterisk qualifiers 
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but have not historically earned a positive wagering return. Using a test for differences 
between two population proportions given in Equation (10), we are able to show that 
dual qualifiers win a statistically greater proportion of races than non-dual qualifiers 
and asterisk qualifiers win a statistically greater proportion of races than non-asterisk 
qualifiers. The test statistic is 


bee (Dag a Podq ) (10) 


where p is a proportion of winners from dual qualifier (dq), non-dual qualifiers (ndq), 
and the pooled sample and Maq is the number of dual qualifiers and naq non-dual 
qualifiers. The 116 dual qualifiers won 28 of 61 Preaknesses between 1946 and 2006. 
This translates into 24.1% (218/116) of dual qualifiers were winners versus only 7.5% 
(33/440) of non-dual qualifiers which is significantly different at a 1% level (z = 5.10). 
The results were similar for asterisk qualifiers (z = 5.36) with 22.0% winners (36/164) 
and non-asterisk qualifiers with 6.4% winners (25/392). 

While dual and asterisk qualifiers won the Preakness more than non-qualifiers, the 
wagering returns were negative for flat win bets on dual and asterisk qualifiers from 
1946 to 2006 (see Table B2 in the Appendix and Figure 6). Wagers on each of the 
116 dual qualifiers would yield a loss of $14.90 (—12.8%). The interval from 1981 to 
2006 which corresponds to the first Roman/Rasmussen publication on dosage in the 
Daily Racing Form, however, does show a net positive return of $2.80 (7.6%) for dual 
qualifiers (see Table B2 in the Appendix and Figure 7). Win wagers on asterisk qualifiers 
lose $39.80 (24.3%) from 1946 to 2006 and $17.10 (—10.4%) from 1981 to 2006. 


Preakness Stakes 
Wealth Level History for $1 Win Bets 
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FIGURE 6 Preakness wealth level history for $1 win bets, 1946-2006. 
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FIGURE 7 Preakness wealth level history for $1 win bets, 1981-2006. 
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FIGURE 8 Belmont Stakes wealth level history for $1 win bets, 1946-2006. 


9. THE BELMONT STAKES, 1946-2006 


The Belmont Stakes provides a unique test of stamina where competitors run a full 
1/4 mi farther than they have before. GZ find that the application of expert information 
from the dosage theory does result in a significant advantage. Table B3 and Figures 8 
and 9 give the wealth level histories for 1946-2006 and 1981-2006, respectively. Both 
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Belmont Stakes 
Wealth Level History for $1 Win Bets 


Wealth After Betting for Each Year 
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Year 
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FIGURE 9 Belmont Stakes wealth level history for $1 win bets. 1981-2006. 


demonstrate considerable statistically significant positive profits. Year by year dual and 
asterisk results for 1946-2006 have $1 flat win bets beginning in 1946 growing to result- 
ing in $105.55 (98.6%) and $164.85 (113.7%) in profit for dual and asterisk qualifiers, 
respectively. The results for 1981-2006 yield $80.85 (216.3%) and $67.95 (129.7%) in 
profit for dual and asterisk qualifiers, respectively. 

Using differences between two population proportions given earlier in Equation 
(10), gives that the proportion of dual qualifier winners is statistically greater than 
the proportion of non-dual qualifying winners at less than the 1% significance level 
(z = 6.73). Indeed, the dual qualifiers won 29.9% of the time vs. 6.8% for the non-dual 
qualifiers. We reject the hypothesis at Paq = Pydq at a level well below 1% signif- 
icance. The proportion of asterisk qualifier winners is also statistically greater than 
the proportion of non-asterisk qualifier winners at less than the 1% significance level 
(z = 6.26). The asterisk qualifiers won 25.5% of the time versus 6.2% for the 
non-asterisk qualifiers. 


10. CONCLUSIONS 


The racetrack is a useful financial market for testing market efficiency and considerable 
evidence exists in support of the track’s win market being weak-form efficient. This 
chapter, however, summarizes the work of BHZ and GZ that show the win market is not 
semi-strong efficient. 

BHZ and GZ focused on a particular aspect of the Kentucky Derby and Belmont 
Stakes, whose distances of 1⁄4 and 1'4 miles are typically farther than any entrant 
has ever raced, and the Preakness Stakes, which is run between these two races at 
ymi. This lack of direct evidence of an entrant’s stamina for this race has motivated 
the search for indirect evidence. Dosage theory, which analyzes a horse’s pedigree, has 
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been offered as such evidence but it has also been controversial, both in general and 
in its relation to the Kentucky Derby. Other evidence that has been offered includes 
well-publicized rankings of horses and results from recent high-caliber races. 

BHZ did not evaluate the criticisms or the justifications offered for the dosage con- 
cept and for the ranking of two-year-olds, nor did they attempt to refine their application 
to the Kentucky Derby. Instead, they simply merged this publicly available information 
with the public’s win odds to establish “adjusted” win probabilities. They then tested 
these win probabilities within a betting system based on the optimal capital growth 
model and showed statistically significant profits. 

GZ applied this procedure to the 14mi Belmont Stakes, which is run five weeks 
after the Kentucky Derby. From the 1980s to the mid-1990s when the dual qualifiers 
were having very good success in the Derby, GZ’s results in the Belmont were not as 
good. However, in recent years the situation has reversed with superior results in the 
Belmont than the Derby. The betting systems discussed here are two of many strategies 
used by bettors. In the '%. Preakness, the dosage breeding theory is less of a factor, 
which is understandable given that the race is shorter than the Kentucky Derby, and 
thus evidence of a horse’s stamina exists. Even so, a positive return on dual qualifiers 
exists from 1981-2006. The procedure outlined shows that given the pools from a set of 
races for which the strategy is applicable, the simple model given in Equation (6) can 
be used to test the validity of the strategy. 
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APPENDIX A: Data Sources 


A.1. Public’s Wagering 


For races from 1946 to 1991, betting data were taken from tables published in The 
Courier-Journal, a Louisville, Kentucky, newspaper, usually on the Sunday after the 
Kentucky Derby. The pools for 1970 could not be found. There were several discrep- 
ancies in the data for which the published pools did not sum to the totals, or did not 
correspond with published win odds. Adjustments were made for errors for which 
an apparent revision could be made. For 1999-2002, complete pools were obtained 
from the Bloodstock Research Information Services Website (http://www.bris.com). In 
2001, the pools also appeared on the Website for the home of the Kentucky Derby, 
Churchill Downs (http://www.churchilldowns.com). The 2003 pools were sent directly 
by Churchill Downs, and the 2004 and 2005 pools were obtained courtesy of John Swe- 
tye who obtained them from Philadelphia Park’s Phonebet service. From 1992 to 1998, 
the pools recorded in The Courier-Journal did not have all of the bets included. While 
these totals are not available, win odds based on total wagering are available. There- 
fore, for 1970 and 1992-1998 we estimated the total win pool and backed out a set 
of win pool fractions that are consistent with the published win odds. There were no 
dual qualifiers in 1998 and 2003 so those years were excluded from the dual-qualifier 
modeling. 


A.2. Pedigrees 


Pedigree information was taken from The Blood-Horse magazine, the American Pro- 
duce Records, a software database called “The Pedigree Program,” the pedigree 
query Website, http://owl.netscout.com/pedigree (no longer active), the Del Mar Turf 
Club Website, http://www.dmtc.com/dmtc98/Pedigree/, thoroughbred registries, and 
Roman’s Website, http://www.chef-de-race.com. The 2004 data were sent by personal 
communication from Roman to John Swetye who forwarded them to us. 


A.3. Chef-de-Race Listings 


Classifications of chefs were taken from the original 1981 list (Roman, 2000), the 
American Racing Manual for each year from 1986 (the first year that the list was 
included) to 1993, and from Roman’s Website. For the period 1981~1986, the 1981 
list was used. For years prior to 1981, the original list was used. For 2001 to 2003, 
and 2005, Dosage Indices and EFH rankings tabulated by Roman were taken from his 
Website, http://www.chef-de-race.com, and for 2004 they were sent via email from 
Roman (see Section A.2, above). 
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A.4, Experimental Free Handicap Listings 


The EFH listings were taken from the American Racing Manual, The Blood-Horse 
magazine (print and online at http://www.bloodhorse.com), the Thoroughbred Times 
Website, http://www.thoroughbredtimes.com, and Roman’s Website, http://www.chef- 
de-race.com. 


A.5. Results of the Kentucky Derby and Major Races Prior to the 
Kentucky Derby 


The results of the Kentucky Derby were taken from the Daily Racing Form, both 
print and online (http://www.drf.com), press materials from Churchill Downs, and from 
Chew (1974). Recent results charts were obtained from the following Websites: 


About.com Inc. http://horseracing.about.com 
Sportsline.com Inc. http://www.sportsline.com 
CNN/Sports Illustrated —_http://www.sportsillustrated.cnn.com 
Equibase Company, LLC __http://www.equibase.com 

Daily Racing Form, LLC http://www.drf.com 


The results for the major races prior to the Derby were taken from the American 
Racing Manual and lists from the following Websites: 


Blue Grass Stakes: http://www.keeneland.com/liveracing/history.asp 

Flamingo Stakes: http://hialeahpark.com/99/HallofFame/flamingo.htm 

Florida Derby: http://www.thoroughbredchampions.com/library/fladerby.htm 
Santa Anita Derby http://www.revistahipodromo.com/santaanita.html 


Wood Memorial Stakes _http://www.nyra.com/aqueduct/index2.html 


and the out-of-date site, http://www.iglou.com/tbred/tc97/preps, which was run by the 
Thoroughbred Times. 


APPENDIX B: Kentucky Derby, Preakness, and 
Belmont Winners, 1946-2006 


TABLE B1 Kentucky Derby, 1946-2006 


Dual 
Year Winner Odds qualifiers 
1946 Assault* 8.2 5 
1947 Jet Pilot 5.4 4 
1948 Citation 0.4 4 
1949 Ponder 16 5 
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TABLE B1 (continued) 


Winner Odds 
Count Turf 14.6 
Hill Gail 1.1 
Dark Star 24.9 
Determine 43 
Swaps* 2.8 
Needles 1.6 
Iron Leige 8.4 
Tim Tam* 2.1 
Tomy Lee 3.7 
Venetian Way 6.3 
Carry Back 2.5 
Decidedly 8.7 
Chateaugay* 9.4 
Northern Dancer 3.4 
Lucky Debonair* 43 
Kauai King 24 
Proud Clarion 30.1 
Forward Pass* 2.2 
Majestic Prince* 14 
Dust Commander* 15.3 
Canonero II 8.7 
Riva Ridge 1.5 
Secretariat 1.5 
Cannonade 1.5 
Foolish Pleasure 1.9 
Bold Forbes 3 
Seattle Slew 0.5 
Affirmed 1.8 
Spectacular Bid 0.6 
Genuine Risk 13.3 
Pleasant Colony 3.5 
Gato del Sol 21.2 
Sunny’s Halo 2.5 
Swale 34 
Spend A Buck 4.1 
Ferdinand 17.7 
Alysheba 8.4 


Dual 
qualifiers 


— — N WK Fw LEN UN DWN WN De UT FN HW WN LY 


U P NY WwW WwW ww NW WH RA A 


(continued) 
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TABLE B1 (continued) 


Dual 
Year Winner Odds qualfiers 
1988 Winning Colors * 3.4 3 
989 Sunday Silence* 3.1 3 
990 Unbridled 10.8 4 
991 Strike The Gold* 48 4 
992 Lil E. Tee 16.8 3 
993 Sea Hero 12.9 3 
994 Go For Gin 9.1 5 
995 Thunder Gulch 24.5 6 
996 Grindstone 5.9 6 
997 Silver Charm 4 2 
998 Real Quiet 8.4 0 
999 Charismatic 31.3 6 
2000 Fusaichi Pegasus* 23 5 
2001 Monarchos* 10.5 3 
2002 War Emblem 20.5 3 
2003 Funny Cide 12.8 0 
2004 Smarty Jones 4.1 5 
2005 Giacomo 50.3 2 
2006 Barbaro* 6.1 2 


NOTE: Bold indicates dual qualifier, * indicates 
asterisk qualifier. 


TABLE B2 Preakness Stakes, 1946-2006 


Dual qualifiers Asterisk qualifiers 

Return Return Return Return 

Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
946  Assault* 1.4 2 —$2.00 3 -$0.60 
1947 Faultless 42 4 —$0.80 5 —$0.40 
948 Citation 01 3 ~$2.70 3 —$2.30 
1949 Capot 2.5 2 —$1.20 3 —$1.80 
1950 Hill Prince 0.7 2 —$1.50 2 -$2.10 
1951 Bold 4.1 1 —$2.50 2 -$4.10 
1952 Blue Man* 1.6 3 —$5.50 4 -$5.50 
953 Native Dancer 0.2 1 —$5.30 2 -56.30 
1954 Hasty Road 5 2 —$1.30 3 -$3.30 
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TABLE B2 (continued) 


Dual qualifiers 


Asterisk qualifiers 


Year Winner Odds Number 
1955 Nashua 0.3 2 
1956 Fabius 2.5 l 
1957 Bold Ruler 1.4 3 
1958 Tim Tam* 1.1 2 
1959 Royal Orbit 6.6 4 
1960 Bally Ache 17 3 
1961 Carry Back 1 2 
1962 Greek Money 10.9 2 
1963 Candy Spots 1.5 2 
1964 Northern Dancer 2.1 4 
1965 Tom Rolfe 3.6 2 
1966 Kauai King 1 3 
1967 Damascus 1.8 3 
1968 Forward Pass* 1.1 0 
1969 Majestic Prince* 0.6 1 
1970 Personality* 4.5 3 
1971 Canonero II 3.4 3 
1972 Bee Bee Bee 18.7 2 
1973 Secretariat 0.3 1 
1974 Little Current 13.1 1 
1975 Master Derby 23.4 2 
1976  Elocutionist 10.1 4 
1977 Seattle Slew 0.4 1 
1978 = Affirmed 0.5 3 
1979 Spectacular Bid 0.1 3 
1980 Codex* 2.7 2 
1981 Pleasant Colony 15 1 
1982  Aloma’s Ruler 6.9 1 
1983 Deputed Testamony 14.5 0 
1984 Gate Dancer 4.8 1 
1985 Tank’s Prospect 47 0 
1986 Snow Chief 2.6 2 
1987 Alysheba 2 2 
1988 Risen Star 6.8 2 
1989 Sunday Silence* 2.1 2 
1990 Summer Squall 2.4 2 


Return 
46-06 


—$2.00 
—$3.00 
—$3.60 
$5.60 
~$2.00 
—$2.30 
—$2.30 
-$4.30 
—$3.80 
—$4,70 
—$2.10 
-$5.10 
—$8.10 
—$8.10 
-$9.10 
~$12.10 
—$15.10 
-$17.10 
-$16.80 
-$17.80 
—$19.80 
~$12.70 
—$12.30 
~$13.80 
-$15.70 
~$17.70 
~$16.20 
$17.20 
~$17.20 
—$18.20 
—$18.20 
—$20.20 
—$19.20 
~$21.20 
-$23.20 
—$21.80 


Return 
81-06 


Number 


Retum 
46-06 


Return 
81-06 


$1.50 

$0.50 

$0.50 
-$0.50 
-$0.50 
—$2.50 
~$1.50 
~$3.50 
~$5.50 
$4.10 


ww we A EN WN FF WN YW WwW FW WwW WwW WW U SK N 


HK ON = 


Ww WwW fF WwW FS 


—$4.00 
-$5.00 
—$5.60 
-$6.50 
-$3.90 
—$4.20 
-$5.20 
—$8.20 
—$8.70 
-$9.60 
—$8.00 
—$11.00 
—$14.00 
-$13.90 
—$15.30 
—$13.80 
-$17.80 
~$19.80 
~$21.50 
—-$23.50 
~$27.50 
-$20.40 
—$20.00 
—$21.50 
—$23.40 
—$22.70 
~$21.20 
—$23.20 
~$23.20 
—$24.20 
~$24.20 
~$28.20 
$28.20 
~$32.20 
-$32.10 
—$31.70 


(continued) 


$1.50 


-$0.50 
~$0.50 
-$1.50 
~$1.50 
-$5.50 
—$5.50 
—$9,50 
—$9,40 
~$9.00 
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TABLE B2 (continued) 


Dual qualifiers Asterisk qualifiers 
Retum Retum Return Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1991 Hansel 9.1 2 —$13.70 $4.00 2 —$23.60 -—$0.90 
1992 Pine Bluff 3.5 1 -$10.20 $7.50 1 —$20.10 $2.60 
1993 Prairie Bayou 2.2 1 -$11.20 $6.50 2 —$22.10 $0.60 
1994 Tabasco Cat 3.6 3 -$9.60 $8.10 3 —$20.50 $2.20 
1995 Timber Country 1.9 2 —$11.60 $6.10 3 -$23.50 -$0.80 
1996 Louis Quatorze 8.5 2 -$13.60 $4.10 3 —$26.50 -$3.80 
1997 Silver Charm 3.1 1 —$10.50 $7.20 3 -$25.40 -$2.70 
1998 Real Quiet 2.5 0 —$10.50 $7.20 1 -$26.40 -$3.70 
1999 Charismatic 8.4 2 -$12.50 $5.20 4 —$30.40 -$7.70 
2000 Red Bullet 6.2 2 —$14.50 $3.20 4 -$34.40 -$11.70 
2001 Point Given 2.3 3 —$14.20 $3.50 5 —$36.10 ~$13.40 
2002 War Emblem 2.8 0 -$14.20 $3.50 1 —$37.10 -$14.40 
2003 Funny Cide 1.9 0 —$14.20 $3.50 0 ~$37.10 -$14.40 
2004 Smarty Jones 0.7 1 ~$15.20 $2.50 1 —$38.10 -$15.40 
2005 Afleet Alex 3.3 3 —$13.90 $3.80 4 —$37.80 -$15.10 
2006 Bernardini 12.9 1 —$14.90 $2.80 2 —$39.80 -$17.10 
NOTE: Bold indicates dual qualifier, * indicates asterisk qualifier. 
TABLE B3 Belmont Stakes, 1946-2006 
Dual qualifiers Asterisk qualifiers 
~ Retum Retum Return Return” 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1946 Assault* 1.4 1 ~$1.00 2 $0.40 
1947 Phalanx 2.3 2 $0.30 2 $1.70 
1948 Citation 0.2 4 -$2.50 5 —$2.10 
1949 Capot 5.6 2 $2.10 4 $0.50 
1950 Middleground 2.7 3 $2.80 4 $0.20 
1951 Counterpoint 5.15 3 —$0.20 4 $2.35 
1952 One Count 12.8 2 -$2.20 3 —$0.65 
1953 Native Dancer 0.45 1 -$1.75 1 —$0.20 
1954 High Gun 3.45 2 -$3.75 3 -$3.20 
1955 Nashua 0.15 1 -$3.60 1 -$3.05 
1956 Needles 0.65 2 -$3.95 2 —$3.40 
1957 Gallant Man 0.95 1 —$4.95 1 —$4.40 
1958 Cavan 4.5 0 -$4.95 1 -$5.40 
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TABLE B3 (continued) 
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Dual qualifiers Asterisk qualifiers 

Return Return Return Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1959 Sword Dancer 1.65 3 —$5.30 4 —$6.75 
1960 Celtic Ash 8.4 2 —$7.30 2 —$8.75 
1961  Sherluck* 65.05 4 —$11.30 5 $52.30 
1962 Jaipur 2.85 1 —$8.45 2 $54.15 
1963 Chateaugay* 45 1 —$9.45 2 $57.65 
1964 Quadrangle 6.55 3 —$4.90 4 $61.20 
1965 Hail to All 2.65 1 -$5.90 1 $60.20 
1966 Amberoid SS 2 —$1.40 2 $64.70 
1967 Damascus 0.8 1 —$2.40 1 $63.70 
1968 Stage Door Johnny 44 1 —$3.40 2 $61.70 
1969 Arts and Letters 1.7 1 -$4.40 2 $59.70 
1970 High Echelon 4.5 2 -$0.90 2 $63.20 
1971 Pass Catcher 34.5 3 $31.60 3 $95.70 
1972 Riva Ridge 1.6 3 $31.20 3 $95.30 
1973 Secretariat 0.1 1 $31.30 3 $93.40 
1974 Little Current 1.5 1 $30.30 2 $91.40 
1975  Avatar* 13.2 2 $28.30 4 $101.60 
1976 Bold Forbes 0.9 1 $29.20 1 $102.50 
1977 Seattle Slew 0.4 1 $29.60 1 $102.90 
1978 Affirmed 0.6 3 $28.20 3 $101.50 
1979 Coastal 44 3 $25.20 3 $98.50 
1980 Temperence Hill 53.4 4 $21.20 5 $93.50 
1981 Summing 79 1 $20.20 —$1.00 l $92.50 —$1.00 
1982 Conquistador Cielo 4.1 1 $19.20 —$2.00 2 $90.50 —$3.00 
1983 Caveat 2.6 1 $21.80 $0.60 3 $91.10 —$2.40 
1984 Swale 1.5 1 $23.30 $2.10 1 $92.60 —$0.90 
1985 Crème Fraiche 2.5 1 $25.80 $4.60 1 $95.10 $1.60 
1986 Danzig Connection 8 2 $32.80 $11.60 2 $102.10 $8.60 
1987 Bet Twice 8 2 $39.80 $18.60 3 $108.10 $14.60 
1988 Risen Star 2.1 1 $38.80 $17.60 3 $105.10 $11.60 
1989 Easy Goer 1.6 3 $38.40 $17.20 4 $103.70 $10.20 
1990 Go And Go 7.5 2 $36.40 $15.20 3 $100.70 $7.20 
1991 Hansel 4.1 3 $38.50 $17.30 3 $102.80 $9.30 
1992 A.P. Indy 1.1 2 $38.60 $17.40 2 $102.90 $9.40 
1993 Colonial Affair 13.9 2 $36.60 $15.40 3 $99.90 $6.40 
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TABLE B3 (continued) 


Dual qualifiers Asterisk qualifiers 

Return Return Retum Return 
Year Winner Odds Number 46-06 81-06 Number 46-06 81-06 
1994 Tabasco Cat 3.4 2 $39.00 $17.80 2 $102.30 $8.80 
1995 Thunder Gulch 1.5 1 $40.50 $19.30 1 $103.80 $10.30 
1996 Editor’s Note 5.8 3 $44.30 $23.10 4 $106.60 $13.10 
1997 Touch Gold 2.65 1 $43.30 $22.10 2 $104.60 $11.10 
1998 Victory Gallop 4.5 1 $42.30 $21.10 2 $102.60 $9.10 
1999 Lemon Drop Kid 29.75 3 $70.05 $48.85 4 $129.35 $35.85 
2000 Commendable 18.8 0 $70.05 $48.85 0 $129.35 $35.85 
2001 Point Given 1.35 3 $69.40 $48.20 4 $127.70 $34.20 
2002 = Sarava 70.25 0 $69.40 $48.20 0 $127.70 $34.20 
2003 Empire Maker* 2 0 $69.40 $48.20 l $129.70 $36.20 
2004 Birdstone 36 1 $105.40 $84.20 l $165.70 $72.20 
2005 Afleet Alex 1.15 2 $105.55 $84.35 2 $165.85 $72.35 
2006 Jazil 6.2 0 $105.55 $84.35 l $164.85 $71.35 


NOTE: Bold indicates dual qualifier, * indicates asterisk qualifier. Crème Fraiche was coupled with dual 
qualifier Stephan’s Odyssey. 
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ABSTRACT The Kentucky Derby features top three-year-old thoroughbred horses. Run at 1 2 miles, it is 
typically at least 1/8 mile longer than any of the horses has raced before. This extra distance, usually 
combined with a large field, makes the race a difficult test of stamina for horses this young. Bettors, because 
there is no direct evidence of whether a horse has the stamina to compete effectively at 1 2 miles, are also 
challenged. The informational content of one publicly available, pedigree-based measure of stamina, the 
Dosage Index, is used with simple performance measures to identify a semi-strong-form inefficiency, and 
to create a betting scheme based on the optimal capital growth model that merges these criteria with the 
public’s opinion. Statistically significant profits, net of transaction costs, could have been achieved during 
the period 1981 to 2005. 


Key WorDs: Semi-strong market efficiency, capital growth theory, speculative investments, sports betting 


1. Introduction 


The Kentucky Derby annually gathers many of the top three-year-old thoroughbred horses at 
Churchill Downs in Louisville Kentucky on the first Saturday in May. For the horses entered, 
the race is a new challenge since its distance of 1} miles is typically at least 1/8 mile longer 
than any of them have ever raced. The extra distance of the Kentucky Derby, usually combined 
with a large field that includes many top-flight contenders, presents a significant test of stamina 
for these young horses. For the Kentucky Derby, the uncertainty about each horse’s stamina 
increases the difficulty for the public of establishing accurate win odds. This potential source of 
semi-strong-form inefficiency in the Kentucky Derby win market is the focus of this paper. 

Roberts (1967) defined a market as being ‘weak-form’, ‘semi-strong-form’ or ‘strong-form’ 
efficient if it is not possible to devise a profitable investment scheme net of transaction costs 
based on prices (or, for the racetrack, publicly available odds), based on all publicly available 
information, or based on all information, respectively. For traditional financial markets, there is 
considerable evidence that points to weak-form and semi-strong-form efficiency, but little evidence 
for strong-form efficiency (see Fama, 1970, 1991 and Keim and Ziemba, 2000 for surveys). 
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Weak-form efficiency of the racetrack’s win market means that betting systems based solely 
on the public’s win odds, established through pari-mutuel betting, are not profitable. Evidence 
from many tracks over many years has pointed to weak-form efficiency (Ali, 1977; Asch et al., 
1982).! Weak-form efficiency of the win market is a consequence of four of its features. First, 
transaction costs are high, about 13-20%, depending on track location, so a bettor needs to be 
considerably more successful than the average bettor just to break even.? Second, while the 
challenge is substantial, the concept of the win bet is relatively simple. Thus, bettors have no 
confusion about their task. Third, many racetrack bettors approach their wagering very seriously 
and some are very sophisticated. Fourth, for this serious audience, there is usually an abundance 
of relevant information, including records of past performances and workouts for all the horses, 
breeding, earnings, jockey records, etc. 

For the Kentucky Derby, the first two of these criteria are satisfied for the win market. While 
the third criterion is met, the Kentucky Derby also receives much more interest in North America 
from casual fans than any other race. Because, typically, none of the Derby entrants has raced at 
1 l miles, it can be argued that the fourth criterion is not fully met. 

The main objective of this paper is to determine whether the informational content of one 
particular pedigree-based measure of stamina that is publicly available, namely the Dosage Index, 
in conjunction with simple performance measures, is captured in the pari-mutuel win odds and, 
if not, whether they can be used to develop a profitable betting scheme. 

The operation of the racetrack market is discussed in the following section. Section 3 describes 
the Dosage Index and performance measures, and their application to the Kentucky Derby. The 
data used in the analysis are discussed in Section 4. Section 5 develops a scheme for estimating 
each betting interest’s win probability based on the public’s odds, the Dosage Index and the 
performance measures. The betting model is described and analysed in Section 6, and conclusions 
are given in Section 7. 


2. The Racetrack as a Sequence of Markets 


Prior to a race, bettors engage in markets that establish prices for the various betting opportunities 
for that race. Betting closes immediately before the race begins, and payouts are calculated 
immediately following the race for single race bets. We focus on the market for win betting. 
Suppose there are N betting interests in a race. Let W; be the total amount bet to win on betting 
interest i = 1,..., N. The total win pool for the race is 


w= wW.. (1) 


The ‘track payback’ Q (generally 0.80 to 0.87) is the fraction of each dollar bet that is returned 
to the bettors. The commission or ‘track take’ is 1 — Q. If betting interest k wins the race, then 
win bets on betting interests i Æ k return zero, while each dollar bet on betting interest k returns 
approximately QW / W,. The actual profit per dollar is rounded down to the nearest nickel or dime 
(this is called ‘breakage’). Together the track take and breakage constitute the transaction costs.* 

Typically, each horse in a race runs as a separate betting interest. However, two or more horses in 
arace that have common ownership typically run as a single betting interest known as an ‘entry’. In 
addition, in a race where there would be more betting interests than a preset maximum, the horses 
with the least-impressive credentials are grouped as a single betting interest known as the ‘Field’. 
A bet to win on an entry or the Field pays off if any member of that betting interest wins the race. 
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Both have been common in the Kentucky Derby. However, the long-time regulations in Kentucky 
changed in 2001, so there were no entries and no Field in the Kentucky Derby from 2001 on. 


3. The Dosage Index and Performance Measures 


The fact that usually no Derby entrant has raced at li miles prior to the race has led to the 
search for relevant information from alternative sources, including the horse’s pedigree. One 
method of evaluating a thoroughbred’s pedigree, commonly known as Dosage Theory, has its 
roots in the work of French cavalry officer Lt.-Col. Jean-Joseph Vuillier, who studied the pedigrees 
of exceptional thoroughbreds of the later 18th to the early 20th century (Vuillier (Pseudonym 
Lottery), 1902, 1906, 1928). The concept of thoroughbred dosage evolved through Varola (1974, 
1980), whose patented classification of prominent stallions according to the type of offspring that 
they produced first appeared in a series of articles in The British Racehorse. 

Roman’s (1981) modifications of Varola’s work are known as Dosage Theory. His work was 
outlined in Leon Rasmussen’s Bloodlines column in the Daily Racing Form beginning before 
the 1981 Kentucky Derby. One product of Roman’s pedigree analysis is the Dosage Index (DI), 
which is based on the categorization of prominent stallions in terms of whether they consistently 
sire offspring with distance proficiencies that are incongruous with the dosage profiles of those 
offspring when that stallion is excluded. Classified stallions are called chefs-de-race (or simply 
chefs)>; see Ziemba and Hausch (1987), Roman’s Web site, http: //www.chef-de-race.com for the 
rationale behind the selection of recent chefs, and Roman (2002). 

There are five categories for chef classification in Roman’s system: Brilliant, Intermediate, 
Classic, Solid and Professional. The categorization is based on ‘where they (sires) must lie on 
the speed-stamina spectrum to bring the figures of their descendants back in line with those of 
horses in the general population exhibiting similar performance traits’ (Roman, 2001). A chef 
can be placed in one or two categories. Each time a chef appears in a four-generation pedigree, 
points are awarded in the appropriate category. Points are assigned on a scale of 16 for the first- 
generation sire, 8 for each second-generation sire, 4 for each third-generation sire, and 2 for each 
fourth-generation sire. Sires that are classified in two categories have their points split. After the 
fifteen sires have been assigned points, the total for each category is entered into the Dosage Index 
formula 

_ Brilliant + Intermediate + 1/2 Classic 
Solid + Professional + 1/2 Classic 


It is evident that horses with a high D/ have a pedigree that is weighted towards Brilliant and 
Intermediate chefs, that is, sires who tend to produce offspring with greater sprinting ability than 
their pedigrees would suggest if that sire was eliminated from the pedigree. Horses with a low 
DI are predicted to have stamina. Very seldom will a stakes-quality horse have no dosage points, 
though some have so few that the D/ is unreliable. The dosage profile for 1997 Kentucky Derby 
winner Silver Charm is shown in Table 1. 

After the initial classification of chefs in 1981, Roman found that no Kentucky Derby winner 
from 1940 to 1980 had a DI exceeding 4.0, despite about 1 in 7 entrants having a DJ that high 
(over the interval 1946—1980 considered here). 

The Dosage Index is not a direct measure of the quality of a horse. One quality measure is 
the ‘Experimental Free Handicap’ (EFH), an annual ranking of two-year-old thoroughbreds that 
raced in select races in the USA (see http://www.jockeyclub.com/experimental.asp). Conducted 
since 1933 by the Jockey Club, the EFH assigns the top runners a figurative weight on a scale 
that usually has the two-year-old champion weighted at 126 pounds.® Exceptional horses have 
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Table 1. Dosage Index calculation for 1997 Kentucky Derby winner Silver Charm 


Generation Sire Brilliant Intermediate Classic Solid Professional 


1 Silver Buck 
Buckpasser 8 
Poker 
3 Tom Fool 2 2 
Hail to Reason 4 
Round Table 4 
Wise Margin 
4 Menow 
War Admiral 2 
Turn-to 1 l 
Mahmoud l 1 
Princequillo 1 1 
Nasrullah 2 
Market Wise 
Faultless 


Total 3 > 17 5 0 


Dosage Index = (3 + 5 + 17/2)/(5 + 0 + 17/2) = 1.22. 


been weighted up to 132 pounds. Other top horses are assigned lower weights based on perceived 
ability until a cutoff is reached at about 100 pounds beyond which no more are classified. Usually 
there are 15 to 30 horses classified within 10 pounds of the top-weighted horse. Roman (1981) 
observed that starting in 1972 most Kentucky Derby winners were rated within 10 pounds of the 
top-weighted horse. This observation led to the designation ‘dual qualifier’ for any horse that was 
weighted within 10 pounds of the top-weighted horse on the EFH (indicating the quality of the 
horse) and had a D7 less than or equal to 4.0.7 

Professional handicapper James Quinn offered a second measure of quality to add late- 
developers to the list. He defined what we call an ‘asterisk qualifier’ to be any horse that: (1) won 
at least one of a selection of premier races prior to the Kentucky Derby; (2) had a DI less than or 
equal to 4.0; and (3) was not rated within 10 pounds on the EFH. A horse is a ‘dual-or-asterisk’ 
qualifier if it qualifies for one of these two categories. 

Our objective was neither to judge these measures nor to refine them. Instead, our objective 
was to study whether any predictive power there may be in these widely publicized measures 
was incorporated into the public’s pari-mutuel win odds. If not fully incorporated, then a further 
objective was to investigate whether these measures could be used to determine win probability 
estimates that are sufficiently superior to the public’s so that a profitable wagering scheme based 
on win betting could be developed, despite the significant transaction costs. 


4. Data Acquisition 


This section discusses the nature of the data, while the sources of the data are described in the 
Appendix. 

The public’s win betting pool and results were collected for the period 1946 to 2005. For 52 of 
these years, dollar amounts that the public wagered were found. As a result, for these other eight 
years we did not have the exact fractions that the public wagered on each horse. However, it was 
possible to back out fractions that were consistent with these odds. 
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The Experimental Free Handicap listing and pedigree information for each Derby participant 
were collected for each year from 1946 to 2005. The original list of chefs was published in 1981. 
For years prior to 1981, this list was used, which means that the classification of chefs for 1946 
to 1980 is not completely out of sample. However, the hypothetical betting begins in 1981, so all 
betting is based on lists of chefs that were out of sample. For the period 1981-1986, the 1981 list 
was used (see Appendix for explanation). After 1986, an updated list of chefs was used each year. 

The major races for asterisk-qualifier status, with their 2005 graded stakes classification and 
the years that they have been run over the interval 1946-2005, were the Blue Grass Stakes 
(G1) (1946-2005), Flamingo Stakes (currently not run) (1946-1989, 1992-2001), Florida Derby 
(G1) (1952-2005), Santa Anita Derby (G1) (1946-2005) and the Wood Memorial Stakes (G1) 
(1946-2005). The Flamingo Stakes declined in importance before being cancelled, but was 
included because historically it was an important prep race. 


5. Application of Breeding Information and Performance Measures to Refine Estimated 
Win Probabilities 


Two models were developed for estimating win probabilities that depended on whether a betting 
interest was a dual qualifier or a dual-or-asterisk qualifier. 

The 1995 Kentucky Derby is used in Table 2 to illustrate the required information. Most of the 
possibilities in terms of qualifying are presented in Table 2. Also evident is a complication with 
regard to accounting for pedigree with entries (and the Field): the horses in an entry may not have 
the same qualifier status. This difficulty was handled using the following scheme: 


1. If all members of an entry had the same qualifier status, then the entry was considered as one 
horse with that qualification. 

2. If one member of the entry was a dual qualifier plus had won any of the designated major races 
prior to the Kentucky Derby, the entry was considered to be a dual qualifier regardless of the 
qualifications of the other member(s) (based on the presumption that in most cases most of the 
public’s attention on the entry was due to that horse). 

3. If the members of an entry did not all have the same qualifier status, but each was either a dual 
qualifier or an asterisk qualifier, then the entry was viewed as a dual-or-asterisk qualifier. 

4. In all other cases the entry was considered to be neutral, i.e. neither a qualifier nor not a 
qualifier. 


The qualifier status of the Field was determined in the same manner. For the dual-qualifier model 
there are 67, 0, 10 and 22 betting interests in the respective four categories, and for the dual-or- 
asterisk-qualifier model there are 57, 2, 10 and 30 betting interests in the respective categories. 

With respect to the dual-qualifier model, of the winners there are 29 that are qualifiers, 26 that 
are not qualifiers, and 3 that are part of a neutral entry. With respect to the dual-or-asterisk-qualifier 
model, of the winners there are 41 qualifiers, 16 that are not qualifiers, and 3 that are part of a 
neutral entry. In 1998 and 2003 there were no dual qualifiers so those years were ignored in the 
dual-qualifier modeling. 

We began our modeling with a base-case model that related a betting interest’s win probability 
to the public’s wagering to see if looking solely at the pools without the ‘expert information’ could 
lead to a profitable betting scheme. 

Let W/ be the public’s win bet on betting interest i, W/ be the win pool in race j, and N/ be 
the number of betting interests in race j. For race j, define p} to be the probability that betting 
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Table 2. Sample input data: 1995 Kentucky Derby field 


Qualifier status 


Horse W/W Entry EFH Won specified race DI Dual Asterisk 
Jambalaya Jazz 0.044 115 1.15 

Pyramid Peak 1 - Flamingo 3.00 ° 
Serena’s Song 0.189 122 2.11 ° 

Timber Country 2 126 3.29 e 

Mecke 0.066 Field 107 4.50 

Knockadoon Field - 3.57 

Citadeed Field - 1.60 

In Character Field - 1.77 

Ski Captain Field — 3.67 

Lake George Field = 4.50 

Thunder Gulch 0.033 116 Florida Derby 4.00 ° 

Tejano Run 0.087 121 2.38 e 

Jumron 0.126 115 3.80 

Eltish 0.070 123 3.00 e 

Afternoon Deelites 0.086 124 5.00 

Suave Prospect 0.059 113 4.60 

Talkin Man 0.167 114 Wood Memorial 3.00 . 
Dazzling Falls 0.029 111 6.20 

Wild Syn 0.042 — Blue Grass 4.33 

Wi/ W Post-time fraction of win pool. 

Entry Entry number or Field. 

EFH Experimental Free Handicap weight. Blank implies not weighted. (High weight 


for two-year-olds from 1994 was 126 pounds.) 
Won specified race Winner of a major race prior to Kentucky Derby. 
DI Dosage Index: see Equation 2. 
Qualifier status e implies meets qualifier requirements. 


interest 7 wins and define qj = wj / W! to be the fraction of the win pool bet on betting interest i. 
For this base case, the following model was used for each race 


dn Oa (3) 

Bae (qn)? 
If 6 is 1 then pi equals qi ; 

We used a standard maximum-likelihood approach to estimate the optimal values of 6. Consider 
R independent races and define K = (k,,..., kr) to be an R-tuple representing the winners of 
the R races, i.e. k; is the number of the betting interest that won race j. Let Pi, represent the 
estimated probability based on Equation 3, evaluated before race j, that betting interest k; wins 
race j. The probability that the vector K corresponds to the winners of the R races is 


R 
P(K|8) = | | pi,- (4) 
j=l 
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Transforming Equation 4 into a likelihood function that depends on 6 gives 


R 
LIK) x [| p}. (5) 


j=l 


A maximum-likelihood point estimate for 5, namely 5,4, can be found by maximizing the like- 
lihood as a function of 5. Our first value for dy, was calculated using the first 10 years of data, 
namely 1946-1955 inclusive. Thereafter, the value of ôm; was updated for each year using data 
from 1946 to that year. The win pool fraction for the winner and values for 5y, calculated after 
each year’s race are shown in Figure 1. 

The values for ôm; are less than 1.0 for the years prior to 1974. This is a consequence of the 
public’s more favored betting interests winning less often during this period than would have 
been expected based on the public’s odds. The winners from 1972 to 1979 were dominated by 
favorites, culminating with Spectacular Bid in 1979, so dy, increases over this interval, reaching 
a maximum value of 1.12. During the period 1980-2005, the public’s favorite seldom won, so 
ôm, tends to decrease to it final value 0.92.8 

With values of 5y;, close to 1.0, Equation 3 generates revised win probabilities that differ 
only slightly from the fraction of the win pool. The greatest ratio p/q; over the interval 
1981-2005 is 1.12. This 12% edge is insufficient to offer a positive expected return on a win bet 
after accounting for the transaction costs; hence this simple model points to weak-form efficiency 
of the win market over this period. 
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Figure 1. Value for ôm after each year’s race and the fraction of the win pool bet on the winner, 1955-2005 
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Our main objective was to use this same procedure to create models that modified the win 
probability for each betting interest based on whether or not it was considered a dual qualifier or 
considered a dual-or-asterisk qualifier. 

For this case, the probability of betting interest i winning is 


Fic ET 
wr Gh) 


The variable y equals œ if betting interest k was a dual qualifier (or dual-or-asterisk quali- 
fier depending on the test), 8 if it was classified as not a dual qualifier (or not a dual-or-asterisk 
qualifier if applicable), and 1 if the betting interest was an entry or Field classified as being neutral. 

Based on Equation 6, maximum-likelihood values for a and £, denoted as ay, and mz, were 
calculated each year. The initial estimate was to predict for 1956, using the first ten years of data 
(1946-1955). Figure 2 illustrates the progression of ay, and By, values for the dual-qualifier 
model, and Figure 3 shows ay, and By, values for the dual-or-asterisk-qualifier model. 

In these two figures, the critical pattern is the relative magnitude of œm, and mz. In Figure 2, 
æm, usually exceeds By, until after the 1980 race. Consequently, for this period, the revised 
win probability for each dual qualifier is usually less than or not significantly greater than the 
fraction of the money bet on it in the win pool (from 1956 to 1980 inclusive, only four ratios 
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Figure 2. Values for «mz and By, for dual-qualifier model after each year’s race, 1955-2005 
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Figure 3. Value for œ mz and fmz, for dual-or-asterisk qualifier model after each year’s race, 1955-2005 


of revised win probability to win pool fraction are greater than 1.1 of which only one is greater 
than 1.15). This implies that betting on dual qualifiers, if they had been known, would not have 
been advantageous during that period. In the mid-1970s, dual qualifiers began to win consistently, 
eventually leading to By, exceeding œm, for the remainder of the study period. From 1983 on, 
the revised win probabilities for dual qualifiers exceed their fraction of the win pool. Figure 3 
for dual-or-asterisk qualifiers shows a similar pattern, although By, begins to exceed ay, after 
only three years. Thus, from 1959 on, the model predicts win probabilities for dual-or-asterisk 
qualifiers that exceed their fraction of the win pool over 91% of the time. For example, the original 
and revised estimates of win probabilities for 1995 are in Table 3. 

In Table 3 a betting interest’s estimated win probability rises if it meets the qualifier criterion. 
(This is not necessarily the case if there are many qualifiers because the sum of the probabilities 
is unity.) The effect of considering asterisk qualifiers is demonstrated by Talkin Man, an asterisk 
qualifier but not a dual qualifier. His estimated win probability varies from 0.114 to 0.204 
depending on the criterion used. 

The revised win probability estimates are occasionally sufficiently greater than the fraction of 
the win pool to allow a positive expected return even considering transaction costs. 

The percentage increase in the win probability over the fraction of the win pool for Thunder 
Gulch is much greater than for Entry 2. There is a general tendency for p;/q; to increase for 
qualifiers as q; decreases, which is a consequence of the power function model in Equation 6 
together with values of a < 6 anda < 1. 
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Table 3. Original and revised estimated win probabilities for the 1995 Kentucky Derby 


Betting Interest W/W DQ PDQ DAQ PDAQ 
Entry 1 0.044 -1 0.029 0 0.027 
Entry 2 0.189 1 0.242 1 0.220 
Field 0.066 -1 0.044 -1 0.035 
Thunder Gulch 0.033 1 0.075 1 0.076 
Tejano Run 0.087 1 0.144 1 0.137 
Jumron 0.126 -=l 0.085 -i1 0.068 
Eltish 0.070 1 0.125 1 0.121 
Afternoon Deelites 0.086 -1 0.057 -! 0.046 
Suave Prospect 0.059 -1 0.039 -1 0.031 
Talkin Man 0.167 -1 0.114 1 0.204 
Dazzling Falls 0.029 -1 0.019 -1 0.015 


Wild Syn 0.042 -1 0.027 -1 0.022 


W/W fraction of win pool bet on the betting interest. 


DQ indicator is 1 for dual qualifiers, O for neutral entries and —1 for non-qualifiers. 

PDQ revised estimated win probability based on dual-qualifier mode). 

DAQ indicator is | for dual-or-asterisk qualifiers, 0 for neutral entries and —1 for non-qualifiers, 
PDAQ revised estimated win probability based on dual-or-asterisk qualifier model. 


6. The Betting Model 


Betting amounts were determined using the optimal capital growth model (OCGM) which 
maximizes the expected logarithm of wealth on a race-by-race basis. This approach was developed 
by Kelly (1956) (and is commonly called the ‘Kelly criterion’), and was extended and rigorously 
proved by Breiman (1961). Among its properties are: (1) it maximizes the asymptotic growth rate 
of wealth; (2) it asymptotically minimizes the expected time to reach any specific sufficiently 
large wealth level; and (3) in the long run it outperforms any other essentially different betting 
strategy almost surely and provides infinitely more final wealth than any other essentially different 
strategy. (See MacLean et al., 1992, 2006; Rotando and Thorp, 1992; and Thorp, 2006 for further 
properties, and Ziemba and Hausch, 1986 for simulation results for shorter time horizons.) 

The revised probability of betting interest i winning based on dual-qualifier or dual-or-asterisk- 
qualifier status is p;. Let r; be the gross return per dollar bet based on the win odds established 
by the public. (As in Section 2, we are suppressing the superscript indicating the race number.) 
The OCGM requires solving the following optimization problem for each race 


m=) i=] 


N N N 
imi il 1- p ii t. f >0Yi=1,..., N d ;<1. 7 
miine DP af Ds + fr) s.t. fi i an Ds (7) 


The decision variable f; is the fraction of the current wealth to bet on betting interest i. Suppose 
that the betior’s initial wealth is w and betting interest i wins. Then w fir; is returned to the bettor 
after having invested w ie Fm, for a final wealth of w(1 — yei Ím + firi). The objective 
function determines the logarithm of final wealth for each betting interest winning, weighted by 
the probability of that betting interest winning. The actual initial wealth, w, can be disregarded 
in the formulation by having the decision variables be the fraction of wealth that is bet on each 
betting interest. The constraints comprise a budget constraint and non-negativity.° 

This formulation assumes that the bets are sufficiently small so that they do not infiuence the 
payout on any betting interest, i.e. the bets on betting interest i do not reduce r;. For the Kentucky 
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Derby, the win betting pool is so large that a typical bet is very unlikely to influence payouts.!° 
The large pools also permit the assumption that the percentage bet on each betting interest varies 
little in the final few minutes. 

Revised probabilities for the base-case model based on Equations 3 and 5 are sufficiently close 
to the public’s win probabilities that expected returns are negative on all betting interests over the 
period 1981-2005. Solving Equation 7 with these revised probabilities leads to no bets. 

For the models based on Equation 6 and on status as a dual or dual-or-asterisk qualifier, the 
betting started with an initial wealth of $2,500 in 1981. It was updated after each year based on 
the bets made and the actual outcome of the race. The wealth history for betting over 1981 to 
2005 is shown in Figure 4 for dual qualifiers and for dual-or-asterisk qualifiers. Overall results are 
summarized in Table 4. For the years up to the mid-1980s any advantage identified by the model 
is small and results in small bets. Comparing the values in Figure 4 with those in Figures 2 and 
3 shows that wy, and By, are relatively close over that interval. As the model predicts a greater 
advantage, the amount per bet, and consequently the volatility, grows. 

For comparison, betting $2,500 on the favorite to win from 1981 to 2005 would yield a loss of 
$41,500; betting $200 to win on each dual qualifier would yield a profit of $12,920 on $13,200 
bet; and a $200 bet to win on each dual-or-asterisk qualifier would yield a profit of $7,780 on 
$23,000 bet (neutral entries excluded as qualifiers). The improved return on investment compared 
to the OCGM in the short run is helped considerably by a few huge profits on qualifiers Gato Del 
Sol (1982), Ferdinand (1986), and Thunder Gulch (1995). 
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Figure 4. Wealth level history for Kelly win bets, 1981-2005 
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Table 4. Profits from both models based on Kelly bets and revised 
win probabilities 


Model based on qualifier type 


Dual Dual-or-asterisk 

Number of bets 61 107 
Total amount bet $32,828 $66,467 
Number of bet that won 9 14 
Initial wealth $2,500 $2,500 
Final wealth $5,514 $4,889 
Total profit $3,014 $2,389 
Percentage return on investment 9.2 3.6 


For both models the betting scheme produced profits over the 1980s and up to the mid-1990s, 


but the performance has been poor since. Several possibilities can be considered for this: 


. The sample space is too small. This could mean that the sequence of successes for qualifiers 


for both models from 1972 to 1997 was a short-term run, so that in the long run there is nothing 
to be gained from using either model developed here. 

The limited sample space also implies that the final wealth is sensitive to individual race 
results. To give an idea of the scope, two extreme examples are (i) if a non-dual-qualifier had 
won in 1995, instead of Thunder Gulch, the final wealth for the dual-qualifier model would be 
$2,221, i.e. a slight loss overall, and (ii) if 2005 winner Giacomo had a favorable change of a 
single dosage point in any category, he would have been a dual qualifier and the final wealth 
would have been $13,877 for the dual-qualifier model. 


. It is extremely difficult to make a proper assessment of all of the two-year-olds, so the EFH 


can omit suitable horses. A prominent example occurred in 2003 where eventual 2004 Derby 
winner Smarty Jones was not rated on the EFH. Yet, he had overwhelmed a field of state- 
bred two-year-olds in a stakes race at Philadelphia Park, but that race is not counted when 
determining the EFH. Roman (200Sa) lists other Derby winners such as Winning Colors and 
Sunday Silence who were superior two-year-olds but were not rated on the EFH. 


. Classification of chefs is an ongoing exercise. For example, Alydar was classified as a chef 


subsequent to Strike The Gold winning the Derby in 1991, so Strike The Gold, who won the 
Blue Grass Stakes, is not considered as an asterisk qualifier here (DI = 9.00), yet when Alydar 
was classified, Strike The Gold's DI was reduced to 2.60. Ziemba (1991) wrote a column about 
this prior to the 1991 Derby arguing that Alydar should be a classic chef as he had numerous 
classic distance winners. The classification of Alydar, and at what point, would possibly change 
other dosage indices. However, here we go with Roman’s classification so Strike the Gold is 
not an asterisk qualifier. 


. Roman (2005b) has pointed out the gradual rise in the DZ of Derby winners over time, so the 


failure of the system in the last few years of the study may reflect a shift of the overall breed 
in North America towards speed at shorter distances. Real Quiet (1998), Charismatic (1999) 
and Giacomo (2005) all had D/ values greater than 4.00. 


. The Flamingo Stakes decreased in significance in the final years that it was run. Including the 


Flamingo winner as an asterisk qualifier in recent years was unwarranted in retrospect. Two 
possible solutions are to drop the Flamingo at some point in the analysis, or switch to the 
Arkansas Derby as the fifth significant prep race. 
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Random betting generates expected losses in excess of 16%, due to the 16% track take plus 
breakage. Our results showed both qualifier designations approximately doubling wealth over the 
betting period. However, in light of the variation in wealth displayed in Figure 4, it is important to 
address the statistical significance of these profits. We do so with two approaches. The first treats a 
betting interest’s win or loss as a binomial random variable and then uses a Normal approximation. 
The second approach simulates the set of races assuming random wagering. 

Before considering the first approach, observe that the data in Figure 4 are not ideal for address- 
ing the statistical significance. Wealth generally grows until the mid-to-late 1990s, and then 
dramatically falls. This pattern of wins and losses leads to wealth that is highly variable. Focusing 
on just the dual-qualifier case, Figure 5 superimposes on Figure 4 the wealth level history assum- 
ing that the races were run in reverse order, i.e. we started in 2005 with $2,500, then updated 
wealth based on our results in 2005 and went to the 2004 race, and so on. Thus, the string of large 
losses occurs early with lower wealth, after which wins are more common. The final wealth is 
identical, since the optimal capital growth model simply determines the optimal fraction of wealth 
to bet each year. (For example, losing 10% one year and gaining 20% the other year leads to an 
overall return of 8%, whatever the order of the win and the loss.) Despite the final wealth being 
the same, the wealth histories are very different, as is the appearance of any statistical significance 
to the profits. 

Our goal is to assess the profitability of this system. To eliminate the effect of varying wealth, 
which can dampen or intensify the variance in profits, our test of statistical significance uses bets 


Wealth After Betting for Each Year ($) 


1980 1985 1990 1995 2000 2005 2010 


Figure 5. Betting wealth for Kelly win bets on dual qualifiers with races run forward (1981-2005) and run 
backwards (2005-1981) 
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and returns in each race assuming an identical initial wealth each race. We do not update wealth 
year by year (as in Figure 4); instead the initial wealth each year is assumed to be $2,500., 

Let q be the probability of winning a bet in each trial, n be the number of trials, c be the amount 
wagered each trial, and r be the gross return upon winning,!! and let X be the random variable 
representing the number of wins. The probability of profits exceeding a constant x is 


P[rX —nce>}. (8) 


Assume that the trials are independent. For bets in different races, this assumption is reasonable. 
For multiple bets on the same race — which are common -— wins are negatively correlated since 
if one betting interest wins then the others must lose. Negative correlation leads to a tighter 
distribution of wins, so in this way our analysis based on independent trials will underestimate the 
statistical significance of our results. Since X is binomially distributed, the Normal distribution 
approximates equation 8 as 


1-0[ ae (9) 


vnq(l — q) 


where ® is the cumulative distribution function of a standard N(0,1) variable. 

For dual qualifiers and assuming an initial wealth of $2,500 each year, there were 61 bets totaling 
$7,079, and of these bets, 9 won for a gross return of $11,357 and a profit of $4,278. Thus, 
c = 7,079/61 = 116.0 and r = 11, 357/9 = 1,262. If the system were no better than random 
betting, then q satisfies rq — c = —0.16c, recognizing the 16% track take, giving q = 0.07725. 
Then, by Equation 9, the probability of profits of at least the observed level of $4,278 given 
random betting is 2.0%. Suppose instead that the system is better than random betting but only 
good enough to offer zero expected profits. Then q solves rq — c = 0, or q = 0.09192, and, by 
Equation 9, the probability that such a system would produce at least the observed profits is 6.7%. 

For dual-and-asterisk qualifiers, with initial wealth of $2,500 each year, there were 107 bets 
totaling $13,268; 14 of these bets won for a gross return of $18,253 and a profit of $4,985. Thus, 
c = 124 andr = 1,304. If the system were no better than random betting, then q = 0.07988 and 
the probability of profits of at least the observed level is only 2.6%. Assuming instead that the 
system is better than random betting but only good enough to offer zero expected profits, then 
q = 0.09509 and the probability that such a system would produce at least the observed profits 
is 10.4%. 

Our second approach to addressing the statistical significance of the results involved two simu- 
lations for each qualifier designation. The first simulation dealt with the question of how likely it 
would be that profits at the observed level would have been generated if our approach is vacuous 
and, therefore, is essentially nothing beyond random wagering. The second simulation asked how 
likely it would be that the observed profits would have been generated if the system is able to 
improve upon random wagering, but only enough to achieve zero expected return on each wager 
(excluding breakage). The algorithm for the first simulation was 


1. Start with a betting wealth of $2,500 in 1981. 

2. Determine the fraction of wealth to wager on each betting interest i for the current year based 
on the Kelly criterion and the (wrong) assumption that our probability estimate, p;, is correct. 

. Randomly select the winner, with the probability of winning for betting interest i being q;. 

. Based on the simulated winner, its payout and our wagers, update wealth. 

. Repeat steps 2 to 4 for each year in order up to 2005. 


Un & Ww 
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Table 5. Results from 10,000 betting simulations with $2,500 initial wealth 


Dual qualifier Dual-or-asterisk qualifier 


Simulation 1 Simulation 2 Simulation 1 Simulation 2 


Final wealth < $1,000 (%) 54.3 37.0 73.8 49.3 
Final wealth < $2,500 (%) 84.2 72.6 91.5 76.1 
Final wealth > System’s 

Final wealth (Table 4) (%) 3.9 9.2 3.1 11.4 
Mean final wealth $1,555 $2,449 $1,030 $2,473 
Median final wealth $892 $1,371 $450 $1,022 


Maximum final wealth $67,601 $161,777 $97,336 $409,294 


The second simulation differed only in step 3, where the simulation used q;/ Q as the correct 
win probability for any betting interest i that received a wager in step 2. The expected return on 
every wager was zero, before accounting for breakage. The collective win probability of the other 
betting interests was such that probabilities summed to unity. For example, in a three-horse race 
with [q1, 42, 93] = [0.42, 0.21, 0.37] and Kelly bets having been placed on the first two horses 
based on [p;, p2, p3], the probability that the simulation would select each betting interest as the 
winner was [0.42, 0.21, 0.37] for Simulation 1, and was [0.5, 0.25, 0.25] for Simulation 2 after 
dividing the first two fractions by the track payback of 0.84. The simulations were run 10,000 
times each. The results are shown in Table 5. 

For all simulations, losses occurred more than 70% of the time. For Simulation 2 and for 
both types of qualifiers, the mean final wealth was close to $2,500, which is expected given the 
modification in step 3 of this simulation. For dual qualifiers, only 3.9% of the time did Simulation 
1 realize profits as high as our observed profit. For Simulation 2, the corresponding value is 
9.2%. For dual-or-asterisk qualifiers, these values for Simulations 1 and 2 are, respectively, 3.1% 
and 11.4%. 

As a final test the analyses were conducted using limited data. For example, if the year being 
considered was 1997 and the interval was 25 years, only information from 1972 to 1996 would 
have been applied. For the dual-qualifer model, using the entire data set produced the greatest 
final wealth; while for the dual-or-asterisk model, using an interval of 56 years produced a final 
wealth of $5,305. 


7. Conclusions 


The racetrack is a useful financial market for testing market efficiency and considerable evidence 
exists, including results in this paper, in support of the track’s win market being weak-form 
efficient, i.e. no profitable system can be developed based on the odds established by the public. 
This paper tests whether the win market is also semi-strong efficient, i.e. no profitable system can 
be developed based on the public’s odds and other publicized information. 

We focus on a particular aspect of the Kentucky Derby, which is that the Derby’s distance of 
14 miles is typically farther than any entrant has ever raced. This lack of direct evidence of an 
entrant’s stamina for this race has motivated the search for indirect evidence. Dosage Theory, which 
analyses a horse’s pedigree, has been offered as such evidence but it has also been controversial, 
both in general and in its relation to the Kentucky Derby. Other evidence that has been offered 
includes well-publicized rankings of horses and results from recent high-caliber races. 
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Our goal has not been to evaluate the criticisms or the justifications offered for the dosage 
concept and for the ranking of two-year-olds, nor do we attempt to refine their application to 
the Kentucky Derby. Instead, we developed a model that takes this information, which is readily 
available to the public and receives much attention, and merges it with the public’s win odds to 
establish win probabilities. We then tested these win probabilities within a betting system based 
on the optimal capital growth model and showed statistically significant profits. 

A specific application of the procedure would be for the 15-mile Belmont Stakes which is run 
weeks after the Kentucky Derby. This analysis is the planned focus of future work. It is known 
though from our preliminary analysis that during the period in the 1980s to mid-1990s when 
the dual qualifiers were having very good success in the Derby their results in the Belmont were 
not as good. However, in recent years the situation has reversed with much better results in the 
Belmont than the Derby. In a more general context, the betting systems detailed here are two of 
many ‘angles’ used by bettors. The procedure outlined shows that given the pools from a set of 
races for which the angle is applicable, the simple model given in Equation 6 can be used to test 
the efficiency of the market with respect to the angle. 
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Notes 


An exception may be extreme favorites at odds of 3-10 or less, which, while quite rare, have been shown to produce 
a small average profit (Ziemba and Hausch, 1986). Inefficiencies in other more complex markets are more common; 
see Hausch et al. (1994) for such evidence. 

Large bettors can reduce this take by betting at rebate sites that return a portion of the bet to make the actual take 
about 10%. We do not deal with such bettors here nor with those outside the USA who wager on Betfair or other 
betting exchanges against other bettors directly rather than in a pari-mutuel pool as discussed here. 

Our notation deals with one race only. In Section 5, to deal with several races simultaneously, we will add a superscript 
to our notation to identify the race number. 

For each track there is a minimum payout, usually 5%, that the track must return even if there are insufficient funds 
available in QW. 

Mares are not included because they are considered to have too few offspring to identify distance proficiencies, while 
it is not unusual for a stallion to sire over 100 offspring in a year. 

Ranking horses by weight is a familiar concept at the racetrack. In handicap races, the top horses carry greater weight 
(jockey + saddle + additional weights if necessary) than the less-qualified horses. Handicapping of this sort occurs 
only in select races and is intended to make the race more competitive. 

Some people expand the dual qualifier category to include any horse that is declared a champion in a country other 
than the USA and has a D/ less than or equal to 4.0. In this paper, only the definition as given. 

Griffith (1949), McGlothlin (1956), Ali (1977) Asch et al. (1982) and Ziemba and Hausch (1986), among others, have 
demonstrated that the public’s wagering has a strong and stable bias of underbetting the favorites and overbetting the 
longshots. This results in ô > 1.0. Ziemba and Hausch (1987) provided evidence that this ‘favorite-longshot bias’ is 
exhibited at the Kentucky Derby but it is weaker, that is more flat, than in these earlier studies. The recent advent of 
rebate and betting exchange wagering has led to a flattening of the favorite-longshot bias in recent data since about 
1998 (Ziemba, 2004). 

The solution for Equation 7 was obtained using the Fortran package DONLP2 written by Peter Spellucci which is 
available from NetLib (http: //www.netlib.org). 

See Hausch et al. (1981) for a formulation that does account for the bettor’s effect on payouts. 
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1 In practice, q, c, and r vary across races and even within races if there are multiple wagers. We approximate the 
sequence of wagers by using the average values of these parameters. 
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Appendix: Data Sources 
(1) Public’s Wagering 


For races from 1946 to 1991, betting data were taken from tables published in The Courier-Journal, 
a Louisville Kentucky newspaper, usually on the Sunday after the Kentucky Derby. The pools for 
1970 could not be found. There were several discrepancies in the data for which the published pools 
did not sum to the totals, or did not correspond with published win odds. Adjustments were made 
for errors for which an apparent revision could be made. For 1999-2002, complete pools were 
obtained from the Bloodstock Research Information Services Web site http://www.bris.com. In 
2001, the pools also appeared on the Web site for the home of the Kentucky Derby, Churchill 
Downs, http: //www.churchilldowns.com. The 2003 pools were sent directly by Churchill Downs, 
and the 2004 and 2005 pools were obtained courtesy of John Swetye who obtained them from 
Philadelphia Park’s (Bensalem Pennsylvania) Phonebet service. From 1992 to 1998, the pools 
recorded in The Courier-Journal did not have all of the bets included. While these totals are not 
available, win odds based on total wagering are available. Therefore, for 1970 and 1992-1998 we 
estimated the total win pool and backed out a set of win pool fractions that are consistent with the 
published win odds. There were no dual qualifiers in 1998 and 2003 so those years were excluded 
from the dual-qualifier modeling. 


(2) Pedigrees 


Pedigree information was taken from The Blood-Horse magazine, the American Produce 
Records, a software database called The Pedigree Program, the pedigree query Web site 
http://owl.netscout.com/pedigree (no longer active), the Del Mar Turf Club Web site 
http: //www.dmtc.com/dmtc98/Pedigree/, thoroughbred registries, and Roman’s Web site 
http: //www.chef-de-race.com. The 2004 data were sent by personal communication from Roman 
to John Swetye who forwarded them to us. 


(3) Chef-de-Race Listings 


Classifications of chefs were taken from the original 1981 list (Roman, 2000), the American 
Racing Manual for each year from 1986 (the first year that the list was included) to 1998, and 
from Roman’s Web site. For the period 1981—1986, the 1981 list was used. For years prior to 
1981, the original list was used. For 2000 to 2003, and 2005, Dosage Indices and EFH rankings 
tabulated by Roman were taken from his Web site, http://www.chef-de-race.com, and for 2004 
they were sent via e-mail from Roman (see (2)). 
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(4) Experimental Free Handicap Listings 


The EFH listings were taken from the American Racing Manual, The Blood-Horse magazine 
(print and on line http://www.bloodhorse.com), the Thoroughbred Times Web site http:// 
www.thoroughbredtimes.com, and Roman’s Web site http://www.chef-de-race.com. 


(5) Historical Results of the Kentucky Derby and Major Races Prior to the Kentucky Derby 


The results of the Kentucky Derby were taken from the Daily Racing Form, both print and on line 
(http://www.drf.com), press materials from Churchill Downs, and from Chew (1974). Recent 
results charts were obtained from the following Web sites: 


About.com Inc. http: //horseracing.about.com 
Sportsline.com Inc. http: //www.sportsline.com 
CNN/Sports Illustrated http: //sportsillustrated.cnn.com 
Equibase Company, LLC __http://www.equibase.com 
Daily Racing Form, LLC _http://www.drf.com 


The results for the major races prior to the Derby were taken from the American Racing Manual 
and lists from the following Web sites: 


Blue Grass Stakes http: //www.keeneland.com/liveracing/history.asp 

Flamingo Stakes http://hialeahpark.com/99/HallofFame/flamingo.htm 

Florida Derby http://www.thoroughbredchampions.com/library /fladerby.htm 
Santa Anita Derby http: //www.revistahipodromo.com/santaanita.html 


Wood Memorial Stakes —_http://www.nyra.com/aqueduct/index2.html 


and from the Thoroughbred Times current web site http://www.thoroughbredtimes.com and 
their out-of-date site http://www.iglou.com/tbred/tc97/preps. The sites for Hialeah Park and 
the Venezuelan site revistahipodromo.com are out of date now. 
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